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CommNexi^ 

Presents 


The LiSBUS j Async I/O System 

A solution of the 1990’s for today’s data transmission problems. 



RS-232C ' 

l 


Outstandingly Simple and Reliable because LiSBUS"" is 

based on a breakthrough technology which uses the impe¬ 
dance of the bus cable to replace binary addresses. Conse¬ 
quently, data transmission management is greatly simpli¬ 
fied and much more reliable than today's equivalent sys¬ 
tems which require expensive sofware, hardware, and 
personnel investments. 


LiSBUS*": 

A product of our new CommNcxus"" line of communication systems. 


combined into one product without spending a much higher 
amount. The Starter Pack includes all the user needs to 
connect four peripherals and a complete set of LiSBUS*" 1 
Development Tools to create custom applications. 


Outstandingly Practical because is it easy to install and 
easy to operate. No special tools, workbench, or electronics 
expertise are needed. Anyone can be up and running in 
minutes. Just plug in the external modules and configure 
the system with the user-friendly LiSBUS tm Link Control 
Software. Each external module measures only around 
2in. by 2in. 

Outstandingly Flexible because a user can connect up to 
60 peripherals or computers to a controlling computer 
through their RS-232C (COM) ports. To add peripherals, 
just extend the bus cable and add modules. 

At an Unbeatable Price because at $650* for the LiSBUS"” 
Starter Pack, no alternative can offer all these advantages 


Special Pre-release Offer: you can receive 
the LiSBUS' 1 " Starter Pack at the special pre¬ 
release price of $550. Just call or mail in your 
order by February 29, 1992, at the latest. De¬ 
liveries will begin early March. 


For more information and ordering contact: 

In the USA and Canada: In Europe: 


Toll Free (800) 945-3002 

Mon.-Fri. 9am - 9pm EST 

GIGATEC (USA). Inc. 

871 Islington Street 
Portsmouth, NH 03801 USA 
Tel. (603) 433-2227 
Fax (603) 433-5552 


















NEW OBJECT-ORIENTED LANGUAGE 



After you’ve tried new MODSIM II 
other object-oriented languages will seem primitive 

Free trial and, if you act now, free training 


M ODSIM II is a modern 

high-level, object-oriented 
language with these important 
benefits: 

• Object-oriented programming. 

The powerful mechanisms that 
help you develop structured, 
easily maintainable code. 

• A compact, readable language. 
Your programs are at least 25% 
smaller than equivalent C + + 
programs and they are easier to 
read. 

• Symbolic debugger. Error detec¬ 
tion and correction are simplified 
through steptracing, multiple 
breakpoints, and data viewing. 

• Graphics and animation. You 
easily lay out your graphical in¬ 
terface with no programming. 

• Standardization across com¬ 
puters. Your development invest¬ 
ment is preserved when you 
change computer types. 

• Simulation. All the features you 
need to do simulation and auto¬ 
matically collect statistics are 
available if you need them. 

With these benefits, the time and 
cost you need to develop programs is 
sharply reduced. 


Free trial offer 

The free trial contains everything 
you need to try MODSIM II on your 
computer. Try the language, the 
quality and the timeliness of our sup¬ 
port, and our documentation and 
training-everything you need for a 
successful project. 

We’ve been providing software 
and support for 29 years-we’ll be 
here when you need us. 

Computers with MODSIM II 

MODSIM II® is available for all 
popular computers. 

Charter User Group benefits 

CACI is now organizing a MOD¬ 
SIM II Charter User Group. Mem¬ 
bers receive a reduced price, early 
releases, and other benefits. 

For a limited time we also include 
free training. Call today to avoid 
disappointment-class size is limited. 

For immediate information 

In the US, call Hal Duncan at (619) 
457-9681, or Fax (619) 457-1184. In 
Canada, call Peter Holt on (613) 
782-2474, Fax (613) 782-2202. In 
Europe, call Nigel McNamara, in the 
UK, on 044 276 671 671, Fax 044 276 
670 677. 


Rush information on 
MODSIM II. 


Charter User Group offer-return the 
coupon today and we will include a 
free course worth $950. 
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□ Send details on your University Offer 


| Return to: Mi| 

CACI Products Company 
3344 North Torrey Pines Court 
La Jolla, California 92037 

Call Hal Duncan at (619) 457-9681 
Fax (619) 457-1184 
In Canada: 

I CACI Products Division 
1 200-440 Laurier Avenue West 
I Ottawa, Ontario KIR 7X6 

I Call Peter Holt on (613) 782-2474 
I Fax (613) 782-2202 
In Europe: 

| CACI Products Division 
I Suite 11, Coliseum Business Centre 
Watchmoor Park, Riverside Way 
Camberley, Surrey GU15 3YL, UK 

Call Nigel McNamara on 044 276 671 671 
Fax 044 276 670 677 


MODSIM II is a registered trademark and 
service mark of CACI Products Company 
® 1992 CACI Products Company 


□ Yes, I want to learn the reasons for 
the growing popularity of MODSIM II. 
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Computer Society 
President’s 


MESSAGE 


Getting started 


At its November meeting in Nash¬ 
ville, the IEEE Computer Society’s 
Board of Governors adopted the stra¬ 
tegic plan developed by the 1991 Plan¬ 
ning Committee ( Computer , Nov. 

1991, pp. 4-5, and Dec. 1991, p. 81). 
The plan is a “baseline”; it provides a 
departure point for mapping out and 
navigating toward the society’s future. 
Without a strategic plan in place, or¬ 
ganizations as large and complex as 
ours tend to evolve in an incoherent 
and inconsistent manner — a nonopti- 
mal situation that would not allow us 
to maximize the value of the products 
and services we provide our members 
or to minimize the costs of providing 
them. Among other things, the plan 
establishes a process by which the so¬ 
ciety will undertake an annual exami¬ 
nation of its goals and a comprehen¬ 
sive analysis of the specific objectives, 
actions, and programs in place to 
achieve those goals. 

The 1992 Executive Committee and 
the 1992 Board of Governors are 
committed to enabling the wide vari¬ 
ety of processes and initiatives devel¬ 
oped in the plan. Many of these initia¬ 
tives encourage and support the 
potential synergy among our pro¬ 
grams and will take a number of years 
to become effective. More than ever 
before in our history, we must work 
closely together, stressing the rela¬ 
tionships and dependencies of our 
programs rather than treating them as 
disjoint, independent, or even com¬ 
peting activities. The presentations 
and discussions at the 1992 Executive 
Committee planning meeting in San 
Diego in December clearly indicated 
that the seven 1992 vice presidents 
and the other society officers and se¬ 
nior staff intend to work together as a 
team to help bring this about. 

It is obvious to those who have read 
the plan that it was a complex under¬ 
taking requiring significant coordina¬ 


tion; scores of volunteers and staff 
were involved. The document is some¬ 
what lengthy — 118 pages, of which 
40 are proposed changes to either the 
bylaws or the Policies and Procedures 
manual. However, there are surpris¬ 
ingly few guiding principles applied 
throughout. These guiding principles 
were synthesized from a detailed anal¬ 
ysis of the nine task force reports 
commissioned by the Planning Com¬ 
mittee. They are 

(1) Value to members. The society 
must be of value to its members. It is 
our aim to be the leading technical 
and professional society in the field. 
We must provide our members and 
the profession with products and ser¬ 
vices of the highest value at the lowest 
possible cost. We need to continually 
evaluate the environment, services, 
and products we provide to make sure 
they are responsive to the needs of 
our existing and potential members 
and are delivered with care and atten¬ 
tion to quality. We are, after all, a ser¬ 
vice-oriented, membership-based 
technical society. We must know sig¬ 
nificantly more about ourselves, our 
competitors, our potential members, 
and the worldwide computer industry 
to be of value to our members. 

(2) A rewarding environment. The 
society must provide a professionally 
and personally rewarding environ¬ 
ment for its volunteers and staff. All 
of our jobs must be enjoyable; they 
must be made more “do-able”; we can 
not and should not overwhelm our¬ 
selves, our staff, and our members 
with unneeded administrative struc¬ 
tures, committees, rules, forms, and 
procedures that are barriers to volun¬ 
teer participation. We will be working 
to streamline our procedures and 
make them more flexible. 

(3) Technical activities. The soci¬ 
ety’s principal mechanism for main¬ 



taining its technical currency is its 
technical activities program executed 
through its technical committees. We 
need more TCs to respond to changes 
in our field. The formation of new 
TCs in multimedia systems, computa¬ 
tional science, and information sys¬ 
tems is already under discussion. Ad¬ 
ditionally, we need to enable and help 
create synergy among the TCs. New 
budget processes are being put in 
place to help us reach these goals. We 
need TC involvement and leadership 
in all of our major program activities, 
and we are developing a technical 
membership segment champion initia¬ 
tive to address this need. Future issues 
of Computer will carry more news 
about this initiative. 

(4) Policies and programs. The 
Board of Governors should establish a 
vision for the society and set its over¬ 
all guiding policies; the program 
boards should set program-specific 
policies and have oversight and men¬ 
toring functions; the program boards 
should have separate operations com¬ 
mittees that undertake and execute 
the programs. So that the Board of 
Governors can delegate more authori¬ 
ty to the program boards, the plan 
calls for the boards to be constituted 
as policy and oversight boards, inde¬ 
pendent of the persons or groups 
whose operations they oversee. The 
program boards’ primary responsibili¬ 
ty will be the continuing evaluation of 
the technical vitality of every program 
or service, and assessment of their 
continuing viability in terms of the 
needs of the membership and the pro¬ 
fession. Budgetary proposals of indi¬ 
vidual programs must be reviewed by 
the program boards. The program 
boards will assign priority and funding 
levels to the programs, balancing 
them against the budgetary targets as¬ 
signed to them by the Board of Gov¬ 
ernors. Most of the bylaws changes 
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published in the December issue 
of Computer (pp. 82-84) are directly 
related to this policy/oversight em¬ 
powerment. 

Some additional areas that we’ll be 
working on in this coming year are 

(1) Self-governance within the soci¬ 
ety. The plan recognizes that each lev¬ 
el in the volunteer hierarchy should 
perform the functions appropriate to 
that level, seeking neither to usurp the 
responsibilities of higher bodies nor to 
micromanage the work of subordinate 
bodies. It also adopts the mandate 
that each board’s volunteer leaders be 
selected directly by the groups they 
serve. 

(2) Volunteer and staff resources. 

We will work to minimize the paper 
flow and administrative overheads, 
and to make all our jobs easier and 
enjoyable. We believe this will en¬ 
courage greater involvement by mem¬ 
bers as volunteer leaders in all of our 
activities. We need to continually 


evaluate our staff resources to make 
sure they are adequate and appropri¬ 
ate to the world’s leading society of 
computing professionals. 

(3) Having timely and useful data 
available for decision making. This ob¬ 
vious need was addressed by Past 
President Duncan Lawrie ( Computer , 
Oct. 1991, pp. 4-5), by the staffing 
plan that the Board of Governors 
adopted in Nashville, and by the stra¬ 
tegic plan. The board approved a new 
staff position devoted to coordinating 
and acquiring such data for us. 

(4) Fiscal responsibility. We must 
continue to build the society’s finan¬ 
cial reserves. We will continue to be 
vigilant in budget-related issues — es¬ 
pecially in the face of what seems to 
be an unending recession that has hit 
the international computer industry 
particularly hard. Each of our pro¬ 
gram boards must conduct its work 
with less funding than in 1991. We 
must make intelligent use of these 
funds, maximizing what we do with 
them. Given fewer dollars to support 


travel and less time for meetings, we 
will try to conduct as much of our 
work as possible via electronic mail, 
teleconferences, and fax. I have asked 
all of the senior volunteers and staff 
to pay careful attention to their bud¬ 
gets on a regular basis and to try to 
cut expenses for the society in every 
way it is prudent to do so without im¬ 
pacting the quality of the products 
and services we offer. 

Our work is far from over. The 
baseline plan will take several years to 
roll out. It will require the commit¬ 
ment and involvement of several suc¬ 
cessive Boards of Governors and sev¬ 
eral Executive Committees, and 
increasing involvement by the mem¬ 
bership to help us maintain a vibrant, 
relevant society. I look forward to 
working with you to make this hap¬ 
pen. 


Bruce D. Shriver 
Computer Society president 
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INTERNATIONAL CONFERENCE ON FIFTH GENERATION COMPUTER SYSTEMS 1992 
JUNE 1 (Mon.)-5 (Fri.), 1992 TOKYO, JAPAN 

The FGCS Conference, with three predecessors held in 1981, 1984 and 1988. It has received high 
ratings as an event that provides an opportunity for many distinguished scholars, top-class research 
engineers, and high caliber business and government people to gather from all over the world to report 
research results and exchange opinions. We look forward to your participation in the conference. 


GENERAL INFORMATION 

HOST: 

Institute for New Generation Computer Technology 
(ICOT) 

SUPPORT: 

Ministry of International Trade and Industry 
COOPERATION: 

• Information Processing Society of Japan 

• The Institute of Electronics, Information and 
Communication Engineers 

• Japanese Society for Artificial Intelligence 

• Japan Society for Software Science and Technology 

• IEEE Computer Society @ 

• The Association for Computing Machinery aCITI 

• The Association for Logic Programming 

DEMONSTRATION: 

The final research results of the Japanese FGCS project 
will be demonstrated. 

REGISTRATION FEE: 

¥90,000 by March 31, 1992 
¥110,000 after April 1, 1992 

APPLICATION: 

Those who wish to make a registration for the confer¬ 
ence, please request Registration Form to the Secre¬ 
tariat. 

ORGANIZATION: 

Conference Committee 
Conference Chairperson: Hidehiko Tanaka, 

Univ. of Tokyo 

Vice-Chairperson: Koichi Furukawa, ICOT 
Program Committee 

Chairperson: Hozumi Tanaka, Tokyo Institute of 
Technology 

The Program Committee members are composed of 66 
members from 8 countries. 


FGCS'92 Secretariat: 


CONFERENCE SCHEDULE 

PLENARY SESSIONS Jun. 1-Jun. 2 

• OPENING SESSION 

• INVITED LECTURES 
Erich Bloch 

(Former Director of National Science Foundation, 
U.S.A.) 

J.A. Robinson 
(Syracuse Univ., U.S.A.) 

• REPORT ON ICOT RESEARCH RESULTS 
TECHNICAL SESSIONS Jun. 3^Jun. 5 

• INVITED LECTURE 
C.A.R. Hoare 
(Oxford Univ., U.K.) 

• INVITED PAPERS 

Raymond Reiter (Univ. of Toronto, Canada) 

William Dally (MIT, U.S.A.) 

Ivan Bratko (Josef Stefan Institute, Yugoslavia) 

• PRESENTATION of about 90 papers selected from 
256 submitted papers 

Foundations 

Architectures & Software 
Applications & Social Impacts 
ICOT Research Topics 

• PANEL DISCUSSION 

A springboard for information processing in the 21st 
century 

• DEMONSTRATIONS 

Parallel inference machine (PIM) 

Basic software 

Knowledge processing software 
Experimental parallel application software 


Institute for New Generation Computer Technology (ICOT) 

Mita Kokusai Bldg. 21F, 1-4-28 Mita, Minato-ku, Tokyo 108, Japan 
Phone: +81-3-3456-3195 E-mail: fgcs92@icot.or.jp 
Fax: +81-3-3456-1618 Telex: 32964 ICOT J 













Guest Editors’ 
Introduction 


Parallel 
Processing for 
Computer 
Vision and 
Image 

Understanding 



Alok Choudhary and Sanjay Ranka, Syracuse University 


V ision has long fascinated re¬ 
searchers from such disciplines 
as psychology, neural science, 
computer science, and engineering. 1 
What exactly is vision? It has been de¬ 
fined as the process of recognizing ob¬ 
jects of interest from images. The word 
process refers to some form of process¬ 
ing performed on the input data, which 
may be one or more images. The phrase 
objects of interest implies a context within 
which this processing takes place, as 
well as the presence of a representation 
used in this processing. For example, if 
we were asked, Is there a table in this 
room? we would compare the represen¬ 
tation of a table contained in our minds 


with something similar to it in the room. 
In that process, we would ignore objects 
that did not look like a table. That is, we 
start with a model and look for some 
instance of the model in the room. On 
the other hand, if we were told, “De¬ 
scribe all the objects in this room,” we 
would scan the room, form representa¬ 
tions of the objects, and match them to 
objects in our “knowledge bases.” How¬ 
ever, we may or may not know what to 
expect in the room. Both problems are 
vision problems. 

A general vision problem is consid¬ 
ered to be an ill-posed problem from a 
computational perspective. The ultimate 
achievement of computer vision will be 


to perform the tasks normally per¬ 
formed by human vision. 

The role of parallel 
processing 

Problems in computer vision are com¬ 
putationally intensive. Consider a se¬ 
quence of images at medium resolution 
(512 x 512 pixels) and standard frame 
rate (30 frames per second) in color (3 
J bytes per pixel). 2 This represents a rate 
of almost 24 million bytes of data per 
' second. A simple feature-extraction al¬ 
gorithm may require thousands of basic 
operations per pixel, and a typical vi- 
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Computer performance 
in billions of operations 
per second 


Grand Challenges 


Climate modeling 
Fluid turbulence 
Pollution dispersion 
Human genome 
Ocean circulation 
Quantum chromodynamics 
Semiconductor modeling 
Superconductor modeling 
Combustion systems 
Vision and cognition 


Structural ' 
biology 
Pharmaceutical i 
design i 


Figure 1. Performance requirements for “Grand Challenge” problems. 4 


Levels of processing 

Low. This level is normally 
termed bottom-up processing. 2 
Most image-processing operations 
fall into this category. Input data in¬ 
cludes images or simple transfor¬ 
mations of images. Computations 
in low-level processing are regular, 
exhibit high spatial parallelism, and 
mainly involve numeric processing. 
These computations are well suited 
for both SIMD (single instruction, 
multiple data) and MIMD (multiple 
instruction, multiple data) architec¬ 
tures. Example algorithms include 
edge detection, filtering operations, 
and connected component labeling. 

Intermediate. This category con¬ 
veniently bridges bottom-up (low- 
level) and top-down (high-level) 
processing. 2 Computations in this 
category manipulate symbolic and 
numeric data. Examples of symbol¬ 
ic data include edges and lines 
commonly referred to as tokens. 
Processing on this level attempts to 
build a coalition of tokens to extract 
meaningful entities, for example, 
forming rectangles from lines. 
Computations are normally data 
dependent and irregular. They are 
suitable for medium- to coarse- 
grain parallelism in MIMD mode, al¬ 
though a subset of computations 
also can be performed efficiently 
on SIMD architectures. 

High. Tasks on this level are nor¬ 
mally top-down (or model-directed). 
However, processing is not so well 
defined as on the other two levels. 
The model of the “world” drives the 
processing. The world represents a 
database of objects, their possible 
poses, and interrelationships in a 
context. For example, a model of a 
car has descriptions of wheels, 
doors, etc., and constraints de¬ 
scribing their relationships. Further¬ 
more, processing in this domain 
may require reexecuting algorithms 
from the other two levels. Although 
parallelism on computations at this 
level is not well understood, it is 
believed necessary to dynamically 
schedule computations. The diver¬ 
sity and highly data-dependent na¬ 
ture of computations make this lev¬ 
el of processing largely suitable for 
MIMD parallelism. 


sion system requires significantly more 
complex computations. As you can see, 
parallel computing is essential to solv¬ 
ing such a problem. 

In fact, the need to speed up image- 
processing computations brought par¬ 
allel processing into the computer vi¬ 
sion domain. Most image-processing 
algorithms are inherently parallel be¬ 
cause they involve similar computations 
for all pixels in an image. This has in¬ 
spired the development of array pro¬ 
cessors. For example, NASA Goddard 
Center’s MPP 3 has a 128 x 128 processor 
mesh-connected array specifically built 
for image processing. 

A mesh architecture connects each 
processor to four neighboring proces¬ 
sors: north, south, east, and west. It is 
suitable for image processing because 
its structure closely mimics that of a 
two-dimensional image. It also provides 
an efficient local communication struc¬ 
ture. 

However, the lack of efficient global 
communication capabilities in a mesh 


architecture — and the requirement for 
fast top-down and bottom-up image pro¬ 
cessing — led researchers to propose 
another architecture known as a “pyra¬ 
mid.” This architecture normally con¬ 
sists of several levels of meshes in which 
the top level has one processor and each 
succeeding level has four times as many 
processors as its parent array. In addi¬ 
tion to the mesh interconnections with¬ 
in each level, each processor is also con¬ 
nected to its four children (except in the 
bottom layer) and to its parent (except 
for the root). Therefore, a pyramid ar¬ 
chitecture maintains several levels of 
image representations simultaneously. 
Both mesh and pyramid architectures 
have contributed significantly to the 
understanding and development of new 
algorithms for image analysis and vi¬ 
sion, and have considerably influenced 
the subsequent designs of parallel ar¬ 
chitectures for vision. 

Current status. Parallel processing has 
taken tremendous strides in the last 
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decade, enabling scientists to perform 
very large scientific computations that 
were impractical a few years ago. But 
the immense computational challenge 
presented by vision is still to be met. In 
fact, compared to the impact of parallel 
processing in other areas, its impact in 
the vision domain has been minimal for 
several reasons. 

A typical vision system requires inte¬ 
grating algorithms from diverse areas 
such as image processing, numerical 
analysis, graph theory, artificial intelli¬ 
gence, and databases. There is no clear 
understanding and consensus on how 
to achieve this. Specific problems in 
integration can also be attributed to a 
lack of understanding of the vision pro¬ 
cess itself, even if the computations 
and parallelism of some individual com¬ 
ponents are well understood. 

Currently, the dominant approach 
to characterizing vision computations 
is to classify the processing require¬ 
ments into three levels: low, intermedi¬ 
ate, and high. The most recent image¬ 
understanding benchmark 2 embodies 
this characterization. The accompany¬ 
ing sidebar presents a brief overview of 
each level. 

Recent efforts in architectural de¬ 
sign and development have embedded 
architectural components for each lev¬ 
el of processing into one integrated 
architecture (as explained in the side- 
bar). Compared to the progress in ar¬ 
chitectural advances in general-purpose 
parallel processing for other scientific 
disciplines, however, architectural ad¬ 
vances for vision systems are in their 
infancy. 

Future directions 

Artificial vision is one of the “Grand 
Challenges” identified by the federal 
government’s High-Performance Com¬ 
puting and Communication (HPCC) 
initiative 4 (see Figure 1). Solving these 
problems is expected to require raw 
computational power between 100 and 
1,000 billions of operations per second. 
Besides raw computational needs, oth¬ 
er issues must be addressed to effi¬ 
ciently use parallel processing in solv¬ 
ing vision problems. 

Architectures. As discussed, vision 
problems span a broad spectrum of 
computations suitable for different 
types of architecture (like special-pur¬ 


pose, SIMD, and MIMD). The compu¬ 
tations in various stages of processing 
not only need to execute concurrently 
in a system but also must interact with 
each other. Hence, the future entails 
some form of heterogeneity in an ar¬ 
chitecture for vision. The challenge for 
researchers is to capitalize on the ad¬ 
vancements in multiprocessor technol¬ 
ogy and incorporate them optimally 
into an architecture suitable for vision 
problems. 


System integration. A straightforward 
integration of different types of archi¬ 
tectures into one organization may not 
be efficient for solving vision problems. 
The integration must incorporate ap¬ 
propriate control and communication 
structures suitable for providing the 
necessary interaction between various 
parts. It should also reflect the pecu¬ 
liarities of vision systems. In simple 
terms, an integrated organization should 
not be visible to a user as a collection of 
heterogeneous components but should 
provide a unified view of a machine in 
which heterogeneity is transparent to 
the user. 


Programming models and software 
tools. Development of these items has 
lagged far behind progress made in ar¬ 
chitectures. Vision-processing require¬ 
ments present a greater challenge be¬ 
cause vision requires programming 
models and subsequent languages pow¬ 
erful and flexible enough to handle dif¬ 
ferent types of processing while hiding 
the details of a heterogeneous system. 

Software tools are necessary to pro¬ 
vide users a friendly platform on which 
they can develop applications. If users 
are burdened with the responsibility of 
learning machine organization intimate¬ 
ly in order to use these tools, the gains 
from parallel processing will be limit¬ 
ed. Tools are also important for users 
to rapidly prototype and experiment 
with their algorithms. This is critical to 
developing and testing new techniques 
to solve problems in the vision domain. 

Real-time vision. Examples of real¬ 
time vision applications include robot¬ 
ics and autonomous land vehicles. Such 
applications not only demand high per¬ 
formance but also require predictable 
performance and a capability to inter¬ 
act with the environment (as in active 
vision) through efficient interface with 


sensors. They also place physical con¬ 
straints such as size, power consump¬ 
tion, cost, and robustness on a parallel 
system. Therefore, special-purpose par¬ 
allel computing may yet have a role in 
such systems. ■ 
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Multimodel Parallel Programming 
for Animate Vision 
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Checkers, a complex 
real-time application, 
demonstrates the 
advantages of 
decomposing animate 
vision systems by 
function and 
independently selecting 
an appropriate 
p arallel-programming 
model for each 
function. 


V ision can be viewed as a passive, observational activity, or as one intimate 
ly related to action (for example, manipulation, navigation). In passive 
i vision systems the camera providing the image input is immobile. In 
active vision systems observer-controlled input sensors are used. 1 Active vision 
results in much simpler and more robust vision algorithms, as outlined in Table 1. 

Another dimension for classifying computer vision approaches is reconstruc¬ 
tive versus animate. In the reconstructionist or general-purpose paradigm, the 
vision task is to reconstruct physical scene parameters from image input, to 
segment the image into meaningful parts, and ultimately to describe the visual 
input in such a way that higher level systems can act on the descriptions to 
accomplish general tasks. During the last decade, substantial progress in recon¬ 
structionist vision has been made using both passive and active systems that 
exploit physical and geometric constraints inherent in the imaging process. 
However, reconstructionist vision appears to be nearing its limits without reach¬ 
ing its goal. 

An alternative to reconstructionist vision derives from the observation that 
biological systems do not, in general, perform goal-free, consequence-free vi¬ 
sion. 2 This observation suggests that vision may, of necessity, be a more interac¬ 
tive, dynamic, and task-oriented process than is assumed in the reconstructionist 
approach. Animate vision researchers, inspired by successful biological systems, 
seek to develop practical, deployable vision systems by discovering and exploiting 
principles that link perception and action. Animate systems use active vision and 
are structured as vertically integrated skills or behaviors, rather than as visual 
modules that try to reconstruct different aspects of the physical world. 

Despite the computational simplifications of the animate vision paradigm, a 
parallel implementation is necessary to achieve the required performance. Fortu¬ 
nately, many of the tasks in an animate vision system are inherently parallel. 
Inputs from multiple sensors can be processed in parallel. Low-level-vision 
algorithms are intensely data parallel. Planning and strategy algorithms frequent¬ 
ly search a large state space, which can be decomposed into smaller spaces that are 
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searched in parallel. Thus, finding par¬ 
allelism in the application is easy. How¬ 
ever, the type of parallelism we would 
like to exploit varies among tasks in the 
system, and no single model of parallel 
computation is likely to suffice for all 
tasks. 

The difficulty arises because parallel¬ 
ism can be applied in many ways, using 
different programming constructs, lan¬ 
guages, and runtime libraries to express 
it. Each environment can be character¬ 
ized by the process model it provides: 
the abstraction for the expression and 
control of parallelism. The process model 
typically restricts the granularity of com¬ 
putation that can be efficiently encap¬ 
sulated within a process, the frequency 
and type of synchronization, and the 
form of communication between pro¬ 
cesses. A typical animate vision appli¬ 
cation will likely consist of many tasks. 
Each task may require a different paral¬ 
lel-programming model, and all must 
cooperate to achieve the desired behav¬ 
ior. These multimodel programs require 
an underlying software system that sup¬ 
ports several different models of paral¬ 
lel computation simultaneously and also 
allows tasks implemented in different 
models to interact. 

We believe that to exploit fully the 
parallelism inherent in animate vision 
systems, an integrated vision architec¬ 
ture must support multiple models of 
parallelism. To support this claim, we 
first describe the hardware base of a 
typical animate vision laboratory and 
the software requirements of applica¬ 
tions. We then briefly overview the 
Psyche operating system, which we de¬ 
signed to support multimodel program¬ 
ming. Finally, we describe a complex 
animate vision application, Checkers, 
constructed as a multimodel program 
under Psyche. 

Architecture for 
animate vision systems 

Systems for animate vision have at 
least three components: sensor input, 
cognition, and action. The goal of our 
work is to provide mechanisms for the 
efficient integration of these components 
at both the hardware and software levels. 

Hardware environment. Animate vi¬ 
sion systems require movable, comput¬ 
er-configurable sensors, sophisticated 
effectors or mobile vehicles, and sever¬ 


al high-bandwidth computing devices. 
Our hardware currently consists of six 
key components: 

• a binocular head containing mov¬ 
able cameras for visual input, 

• a robot arm that supports and moves 
the head, 


• a special-purpose parallel proces¬ 
sor for high-bandwidth, low-level 
vision processing, 

•general-purpose MIMD (multiple 
instruction, multiple data) parallel 
processors, 

• a dextrous manipulator, and 

• a Data Glove input device. 


Table 1. Computational features of passive and active vision systems. 


Passive Vision 

Active Vision 

A fixed camera may not have 
an object in view. 

Active vision can use physical search, 
by navigation or manipulation, chang¬ 
ing intrinsic or extrinsic camera 
parameters. 

Static camera placement re¬ 
sults in nonlinear, ill-posed 
problems. 

Known, controlled camera movements 
and active knowledge of camera place¬ 
ment provide self-generated constraints 
that simplify processing. 

Stereo fusion is intractable. 

An actively verging system simplifies 
stereo matching. 

A single fixed camera imposes 
a single, possibly irrelevant, 
coordinate system. 

Active vision can generate and use exo- 
centric coordinate frames, which yield 
more robust quantitative and qualitative 
algorithms and serve as a basis for spatial 
memory. 

Fixed spatial resolution limits 
imaging effectiveness. 

Variable camera parameters can compen¬ 
sate for range, provide varying depth of 
field, and indirectly give information 
about the physical world. 

Segmentation of static, single 
images is a known intractable 
problem. 

Gaze control helps segmentation: Active 
vergence or object tracking can isolate 
visual phenomena in a small volume of 
space, simplifying grouping. 


Tenets of animate vision 


• Vision does not function in isolation, but is instead part of a complex behav¬ 
ioral system that interacts with the physical world. 

• General-purpose vision is a chimera. There are simply too many ways to 
combine image information and too much to know about the world for vision to 
construct a task-independent description. 

• Directed interaction with the physical world can permit information not readily 
available from static imagery to be obtained efficiently. 

• Vision is dynamic. Fast vision processing implies that the world can serve as 
its own database, with the system retrieving relevant information by directing 
gaze or attention. 

• Vision is adaptive. The functional characteristics of the system may change 
through interaction with the world. 
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Figure 1. In this configuration, the robot head has one 
large color camera and two small gray-scale cameras, a 
single tilt motor, twin pan motors, and a passively compli¬ 
ant checker-pushing tool. 


The head, shown in Fig¬ 
ure 1, has two movable gray¬ 
scale CCD (charge-coupled 
device) television cameras 
and a fixed color camera pro¬ 
viding input to a MaxVideo 
pipelined image-processing 
system. One motor controls 
the tilt angle of the two-eyed 
platform. Separate motors 
control each camera’s pan 
angle, providing indepen¬ 
dent vergence control. The 
controllers allow sophisticat¬ 
ed velocity and position com¬ 
mands and data readback. 

The robot body is a Puma 
761 six-degrees-of-freedom 
arm with a 2-meter-radius 
workspace and a top speed 
of about 1 meter per second. 

It is controlled by a dedicat¬ 
ed Digital Equipment Corp. 

LSI-11 computer imple¬ 
menting the proprietary Val 
execution monitor and pro¬ 
gramming interface. 

The MaxVideo system 
consists of several indepen¬ 
dent boards that can be connected to 
achieve a wide range of frame-rate im¬ 
age-analysis capabilities. The MaxVid¬ 
eo boards are all register programma¬ 
ble and can be controlled via a VMEbus. 
The Zebra and Zed programming sys¬ 
tems, developed at the University of 
Rochester, make this hardware easily 
and interactively programmable. 

An important feature of our labora¬ 
tory is the use of a shared-memory mul¬ 
tiprocessor as the central computing 
resource. Checkers, our animate vision 
application, uses a 32-node BBN But¬ 
terfly Plus parallel processor. Each node 
contains a Motorola MC68020 proces¬ 
sor with floating-point hardware and 4 
Mbytes of memory. The Butterfly is a 
shared-memory multiprocessor. Each 
processor can directly access any mem¬ 
ory in the system, although local mem¬ 
ory is roughly 12 times faster to access 
than nonlocal memory. The Butterfly 
has a VMEbus connection that mounts 
in the same card cage as the MaxVideo 
and motor controller boards, replacing 
a processor in the physical address space 
of the multiprocessor. The Butterfly also 
has a serial port on each board. We use 
the port to communicate directly with 
the Val robot controller software on its 
dedicated LSI-11. A Sun 4/330 worksta¬ 
tion acts as a host for the system. 


Several components have only recent¬ 
ly been installed in our laboratory and 
therefore were not used in Checkers. 
These include an array of eight trans¬ 
puters for real-time control, a 16-de- 
grees-of-freedom Utah hand, and a Data 
Glove used to gather manipulation data 
from humans and for teleoperation of 
the Utah hand. 

Software requirements. Animate vi¬ 
sion systems are inherently parallel. The 
hardware devices they use provide one 
source of parallelism. The algorithms 
used for device control and for combin¬ 
ing perception and action provide an¬ 
other source. The real issue is how to 
harness the application’s inherent par¬ 
allelism without being overwhelmed by 
the complexity of the resulting software. 
Our experiences with two DARPA 
benchmarks for parallel computer vi¬ 
sion 3 illustrate the utility of multiple 
parallel-programming environments for 
implementing computer vision algo¬ 
rithms, and the difficulty of successfully 
integrating the components of an ani¬ 
mate vision system. 

The first benchmark contains a suite 
of noninteracting routines for low- and 
high-level-vision tasks. The low-level- 
vision routines require manipulation of 
two-dimensional pixel arrays using data 


parallelism, conveniently ac¬ 
complished using the SIMD 
(single instruction, multiple 
data) style of computation pro¬ 
vided by the Uniform System 
library from BBN. 4 The func¬ 
tions for high-level vision re¬ 
quire coarser grain parallel¬ 
ism and communication, for 
which we used two parallel¬ 
programming environments 
developed at Rochester: a 
message-passing library and a 
parallel-programming lan¬ 
guage (Lynx 5 ). Each environ¬ 
ment made programming a 
particular application easier 
in some way. 

The second benchmark calls 
for an integrated scene-de¬ 
scribing system. This bench¬ 
mark emphasizes integration 
of several levels of image un¬ 
derstanding to describe a scene 
of polygons at various discrete 
depths. It thus underscores the 
usefulness of a unified ap¬ 
proach to multimodel paral¬ 
lelism. Unfortunately, we im¬ 
plemented the parts of our scene¬ 
describing system using several differ¬ 
ent programming models, and we lacked 
the system support necessary to inte¬ 
grate the various models. Processes in 
our MIMD computations could not di¬ 
rectly access the data structures pro¬ 
duced by our SIMD computations, and 
processes of different types could not 
synchronize. Ironically, the very diver¬ 
sity that facilitated our success in the 
first benchmark prevented a successful 
implementation of the second. 

The DARPA benchmarks and our 
other applications experience showed 
the potential advantages of using a large- 
scale MIMD multiprocessor as the con¬ 
trolling architecture in integrated ani¬ 
mate vision systems. Our experiences 
also demonstrated the importance of 
matching each application, or parts of a 
large application, to an appropriate par¬ 
allel-programming environment, and the 
importance of integrating functions 
across environment boundaries. 

Multimodel 

programming in Psyche 

Psyche is a multiprocessor operating 
system designed to support multimodel 
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programming. 6 The abstractions provid¬ 
ed by the Psyche kernel allow user-level 
runtime libraries to implement custom¬ 
ized programming models, with each 
library retaining full control over how 
its processes are represented and sched¬ 
uled, and how they synchronize with the 
processes implemented by other librar¬ 
ies. 7 The kernel is responsible for coarse- 
grain resource allocation and protec¬ 
tion, while runtime libraries implement 
short-term scheduling and process man¬ 
agement. 

A multimodel program consists of a 
set of modules, each of which may im¬ 
plement a (potentially different) pro¬ 
gramming model. Each module defines 
a set of interface procedures used to 
access the code and data encapsulated 
by the module. In addition, each mod¬ 
ule that implements a programming 
model defines a set of process-manage¬ 
ment routines used to control the be¬ 
havior of the model’s processes. To com¬ 
municate between modules, and hence 
between different programming mod¬ 
els, processes call interface procedures, 
which may in turn call process-manage¬ 
ment routines. The two key aspects of 
this approach are 

•Procedural access to shared data. 
The interface procedures of a module 
implement a communication protocol. 
Shared data in the module represents 
the state of the protocol and is manipu¬ 
lated by the procedures defined in the 
interface. By invoking interface proce¬ 
dures, processes take on the communi¬ 
cation style provided by the called mod¬ 
ule. Interface procedures allow processes 
to gain access to arbitrarily complex 
styles of communication through a mech¬ 
anism — the procedure call — found in 
every model. 

• Dynamically bound process man¬ 
agement. The process-management rou¬ 
tines of a module allow interface proce¬ 
dures to tailor their behavior to different 
programming models. Interface proce¬ 
dures call process-management routines 
whenever they must create, destroy, 
block, or unblock a process. They need 
not embody assumptions about any par¬ 
ticular model. Among other things, pro¬ 
cess-management routines are required 
for scheduler-based synchronization. 

To facilitate the use of shared data 
and procedures, Psyche arranges for 
every module to have a unique system- 
wide virtual address. This uniform ad¬ 


dressing allows processes to share point¬ 
ers without worrying about whether they 
might refer to different data structures 
in different address spaces. It also al¬ 
lows processes to call interface proce¬ 
dures regardless of the current address 
space. Depending on the degree of pro¬ 
tection desired, a call to an interface 
procedure can be as fast as a normal 
procedure call ( optimized invocation), 
or as safe as a remote procedure call 
between heavyweight processes (pro¬ 
tected invocation). The two forms of 
invocation are initiated in exactly the 
same way in Psyche, with the native 
architecture’s jump-to-subroutine in¬ 
struction. In some cases this instruction 
generates a page fault, allowing the ker¬ 
nel to intervene. 

To implement a programming model, 
the user-level library must manage a set 
of virtual processors, which are kernel- 
provided abstractions of the physical 
processors. The kernel delivers a soft¬ 
ware interrupt to a virtual processor 
whenever it detects an event that might 
require the library to take immediate 
action. Events of this sort include pro¬ 
gram faults, the imminent end of a ker¬ 
nel-scheduling quantum, the need to 
block during a system call, and the initi¬ 
ation (via page fault) of a protected 
invocation. Data structures shared be¬ 
tween the kernel and the user (writable 
in part by the user) allow the library code 
to control the behavior of the software 
interrupt mechanism. They also provide 
a location in which to store the addresses 
of process-management routines. 

When creating a virtual processor, 
the user-level library specifies the loca¬ 
tion of a data structure describing that 
virtual processor. The kernel maintains 
a pointer to this descriptor (distinct on 
every physical processor) among the 
data structures shared between the ker¬ 
nel and the user. By convention, the 
descriptor in turn contains the address 
of a vector of pointers to process-man¬ 
agement routines. When multiprogram¬ 
ming virtual processors on top of a sin¬ 
gle physical processor, the kernel 
changes the pointer to the virtual pro¬ 
cessor descriptor on every context 
switch. When multiprogramming user- 
level processes on top of a single virtual 
processor (or on a collection of virtual 
processors), the user changes the ad¬ 
dress of the vector of process-manage¬ 
ment routines on every context switch. 
As a result, an interface procedure can 
always find the process-management 


routines of the currently executing pro¬ 
cess without knowing the origin or rep¬ 
resentation of the process. 

Using interface procedures and pro¬ 
cess-management routines. Psyche pro¬ 
grammers have developed two distinct 
idioms for interaction between dissimi¬ 
lar programming models. Both of these 
idioms appear in Checkers. In the first, 
a module encapsulates a passive shared 
data structure that is accessed by other 
modules containing different process 
types. In the second, a process from one 
model calls directly into a module that 
implements a different model. This sec¬ 
ond approach occurs both when calling 
an interface procedure directly and when 
implementing synchronization inside an 
interface procedure. 

When a process needs to wait for 
some condition while executing in an 
interface procedure, the code can fol¬ 
low pointers in well-known locations to 
find the addresses of the process-man¬ 
agement routines of the process’s pro¬ 
gramming model. It can then save the 
address of the unblock routine in a shared 
data structure and call the block rou¬ 
tine. Later, when another process es¬ 
tablishes the condition, it can retrieve 
the pointer to the unblock routine and 
call into the module that manages the 
waiting process, causing that process to 
unblock. 

When calling an interface procedure 
or process-management routine, a pro¬ 
cess must obey certain well-defined con¬ 
straints to avoid interfering with the 
correct operation of the host environ¬ 
ment. In the general case, it may be 
necessary to create a native process to 
obtain the full capabilities of that envi¬ 
ronment. 

Multimodel robot 
checkers player 

Checkers is a multimodel vision ap¬ 
plication implemented on top of Psyche. 
A checkers-playing robot conducts a 
game against a human opponent, cycli¬ 
cally sensing the opponent’s move and 
then planning and executing its response. 

An inexpensive, standard-size check¬ 
ers game is used. The camera’s internal 
parameters (aperture and focus) are 
manually adjusted before the game, and 
the external parameters (the exact posi¬ 
tions of the pan and tilt motors, and the 
robot’s position) are adjusted by an ini- 
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tial calibration procedure (finding pixel 
coordinates of the corners oftheboard). 

The normal rules of play are obeyed, 
including multiple captures, crowning, 
and the extended capabilities of kings. 
The robot pushes pieces around the 
board; it does not pick them up. During 
play the human player modifies the 
board by moving a piece. The sensing 
task detects the change in the board 
configuration and interprets it symbol¬ 
ically in terms of the primitive moves of 
checkers. In a symbolic, strategic task, 
the robot considers the change to be a 
potentially legal move by the human. 
The robot validates the move and, if it is 
valid, accepts it. Once the human makes 
a valid move, the robot runs a symbolic 
game-playing algorithm to find its re¬ 
sponse to the human move. The effec¬ 
tor task uses the board position report¬ 
ed by the vision subsystem and the 
computed symbolic move to plan a se¬ 
quence of primitive, physically realiz¬ 
able actions. When this movement plan 
is available, the robot arm is engaged to 
execute the plan. A board maintenance 
task provides central control, commu¬ 
nication, data representation, and syn¬ 
chronization. The robot emits status 
information, error messages, and occa¬ 
sional gratuitous remarks through a 
voice-synthesis board. A complete ro¬ 
bot move, including sensing and com¬ 
mentary, takes about 6 seconds. 

Parallel-programming environments. 

Checkers, like many animate vision ap¬ 
plications, consists of tasks to imple¬ 
ment sensing, planning, and action. We 
implemented each of these functions 
using a different parallel-programming 
environment: Multilisp, Lynx, the Uni¬ 
form System, or Uthread. 

The unit of parallelism in Multilisp is 
the future, 8 which is a handle for the 
future evaluation of an arbitrary S-ex- 
pression. Any attempt to reference a 
future before the value is determined 
causes the caller to block. These two 
mechanisms — parallel execution via 
futures and synchronization via refer¬ 
ences to futures — are used to build 
parallel Lisp programs. 

Lynx programs consist of multiple 
heavyweight processes, each with its own 
address space. 5 The processes exchange 
messages using named communication 
channels (links). Each heavyweight pro¬ 
cess consists of multiple lightweight 
threads of control that communicate 
using shared memory. Condition vari- 
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ables are used for synchronization be¬ 
tween threads in the same process. Syn¬ 
chronous message passing provides syn¬ 
chronization between threads in 
different processes. 

The Uniform System 4 is a shared- 
memory, data-parallel programming 
environment. Task generators create a 
potentially large number of parallel tasks, 
each of which operates on some portion 
of a large shared address space. Task 
descriptors are placed on a global FIFO 
work queue and are removed by proces¬ 
sors looking for work. Each task must 
run to completion, at which time another 
task is removed from the task queue. 
Spin locks are used for synchronization. 

Uthread is a simple, lightweight thread 
package that can be called from C++ 
programs. Uthread is the general-pur¬ 
pose programming environment of 
choice in Psyche and is frequently used 
to implement single-threaded servers. 

We chose these environments for four 
reasons: 

(1) Each was specifically developed 
for a particular application domain that 
was a subset of our problem domain. 

(2) Implementations of all four envi¬ 
ronments were either already available 
for our hardware or could be easily port¬ 
ed to our hardware. 

(3) The principals involved in the 
project had extensive experience with 
one or more of these implementations 
and would not have to learn a new sys¬ 
tem. 

(4) We already had a software base 
for vision, planning, and checkers play¬ 
ing, composed of programs written in 
the Uniform System, Lisp, and Lynx, 
respectively. 

Checkers tasks. The primary data 
structures used to implement Checkers 
are the representations of the board 
and the moves. A short pipeline of rep¬ 
resentations is needed to support back¬ 
ing up to legal or stable states. There are 
four different board representations: 
digitized image, calibration information, 
(x, y, z ) location of the piece centroids, 
and a symbolic description of the board. 
Each is used for different tasks. 

Three different representations for 
moves are used: the new board state 
that results from the move, a sequence 
of physical coordinates for the robot 
motion commands, and the list of par¬ 
tial moves (that is, a push or a sequence 
of jumps) needed to execute a move. 


These representations for the board 
and the moves are encapsulated in the 
board module, which provides synchro¬ 
nized access to the data structures and 
translation routines between the vari¬ 
ous representations. We implemented 
the board module using the Uthread 
package. A single thread of control is 
created to initialize the data structures, 
after which the module becomes a pas¬ 
sive data structure shared by tasks from 
other programming models. The board 
module synchronization routines use the 
Psyche conventions for process man¬ 
agement to implement semaphores that 
any model can call. 

Six different tasks cooperate to im¬ 
plement Checkers. Two manage the 
camera and robot devices; the remain¬ 
der implement vision, recognition of 
moves, checkers strategy, and motion 
planning. 

The camera manager is a Uthread 
module that maps and initializes a mem¬ 
ory segment for the VME memory used 
to control and access the MaxVideo 
hardware. This module registers the 
name and address of the memory seg¬ 
ment with a name server. The board 
interpreter (discussed below) accesses 
this segment directly to retrieve an im¬ 
age from the MaxVideo frame buffer. 
The frame buffer is filled at 30 Hz by the 
digitizer but is read out only when the 
board interpreter is ready to analyze 
another image. 

The board interpreter is a Uniform 
System program that transfers an image 
from the camera manager (in VME 
memory) to local Butterfly memory and 
produces a symbolic description of the 
checkers in the image. The data transfer 
of 0.25 Mbyte of image information over 
the VMEbus takes 280 milliseconds. 
After transferring the image, the board 
interpreter segments the image into 64 
board squares and analyzes each square 
in parallel. Each task attempts to deter¬ 
mine the color of its square, whether the 
square contains a piece, and, if so, the 
piece’s color. Each square is then la¬ 
beled according to the square’s color 
and the piece’s color. Piece centroids 
are calculated, as are centers of empty 
squares, in image and world coordinates. 
Once a complete interpretation con¬ 
taining no unrecognized squares is cal¬ 
culated, the board interpreter accepts 
the interpretation. If the new interpre¬ 
tation differs from the previous inter¬ 
pretation, the result is reported to the 
board module. Using four processors. 
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Figure 2. Functional modules and communication paths in Checkers. Multiple 
models of parallelism (to the right of the dotted line) are implemented under 
Psyche on the Butterfly. Perceptual and motor modules (to the left of the dot¬ 
ted line) reside on the Butterfly and in peripherals. 


the board interpreter can interpret the 
image input about once every second. 

The move recognizer is a Uthread 
module that compares two successive 
symbolic board interpretations produced 
by the board interpreter. It recursively 
decomposes the differences into a se¬ 
quence of legal partial moves (single 
jumps or moves) that transforms the 
first interpretation into the second. 

The checkers player is a game-playing 
program written in Lynx. It takes as 
input the list of partial moves describ¬ 
ing the human’s move and produces as 
output the list of partial moves to be 
made in response. A single multithread¬ 
ed master process manages the parallel 
evaluation of possible moves. Slave pro¬ 
cesses perform the work, implementing 
a parallel ex-(I game-tree search. 

The move planner is a trajectory cal¬ 
culation and planning program written 
in Multilisp. It constructs, in parallel, 
artificial potential fields that have peaks 
reflecting square occupancies and a glo¬ 
bal bias reflecting off-board goal loca¬ 
tions. For individual moves, the goal 
location is a particular square. When 
removing pieces, the goal location is 
one of eight goal areas off the board. 
The program considers these potential 
fields in parallel, using a local search 
procedure that yields a gradient-descent 
path along which a checker can be 
pushed. Since the algorithm allows pieces 
to be temporarily moved aside or 
swapped with the moving piece, it is a 
route-maker as well as a route-finder. 
The result is a set of plans. The algo¬ 
rithm chooses one plan on the basis of 
some cost function, such as the total 
estimated time to complete the move or 
the shortest distance to push the checker. 

The robot controller is a Uthread 
module that controls a serial line con¬ 
nection between the Butterfly and the 
Puma robot. The robot controller sends 
movement commands in the Val lan¬ 
guage (equivalent to MoveTo 
(X,Y,Z,SPEED)) and waits for notifi¬ 
cation of successful completion. 

Implementation of moves. Program 
execution is implemented as a series of 
moves, each of which requires the coop¬ 
eration of several modules and program¬ 
ming models. Lettered arrows in Figure 
2 show control flow among the modules. 

The board interpreter continuously 
receives an image from the camera (a, 
b) and analyzes it. When the board po¬ 
sition changes, the board interpreter 


invokes the board module (c) to update 
the board description, passing the sym¬ 
bolic and quantitative positions of the 
checkers. 

When the board module receives a 
new board position from the board in¬ 
terpreter, it invokes the move recogniz¬ 
er (d) to parse the difference between 
new and old board positions into partial 
checkers moves. These partial moves 
are stored in the board module to be 
retrieved by the checkers player. After 
a successful return from the move rec¬ 
ognizer, the original invocation from 
the board interpreter to the board mod¬ 
ule returns, causing the board inter¬ 
preter to resume evaluation of raw im¬ 
age data. 

When the invocation from the check¬ 
ers player to the board module (e) dis¬ 
covers that a new valid list of partial 
moves has appeared in the board mod¬ 
ule, it returns the first partial move to 
the checkers player module. If several 
partial moves are needed to complete 
the move, additional invocations from 
the checkers player to the board mod¬ 
ule (e) follow. If any partial move rep¬ 
resents an illegal move, the checkers 
player resets its internal state to the 


beginning of the move sequence and 
flushes the current state information 
and list of partial moves in the board 
module. It also synchronizes with the 
board interpreter (e, c), which informs 
the human and produces a new board 

As long as the incoming list of partial 
moves is legal, the checkers player will 
wait for moves to appear in the board 
module. As a result, board interpreta¬ 
tion can occur several times while the 
human makes a move, particularly if the 
move is a complex jump. The checkers 
player and board module interact (e) 
until a complete move is made. At this 
point the checkers player module runs 
and generates its reply to the human’s 
move in the form of a symbolic board 
position. This board position is passed 
to the board interpreter, which gener¬ 
ates a list of partial moves required to 
implement the differences between the 
updated board position and the current 
position. 

Once the board interpreter has pro¬ 
duced a list of partial moves that define 
the robot’s response, the checkers play¬ 
er invokes the move planner (f) with the 
partial move sequence. Partial moves 
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Table 2. Functional modules and source code statistics (number of lines) for 
Checkers. The runtime environment code and the Lynx game player were port¬ 
ed from existing systems; 4,482 lines of new code were written, including 2,902 
lines of application code and 1,580 lines of interface code. 


Function 

Model 

Application 

Runtime 

Interface 

Game player 

Lynx 

1,800 

8,838 

504 

Board interpreter 

Uniform System 

380 

8,057 

170 

Move planner 

Multilisp 

900 

13,127 

572 

Board module 

Uthread 

1,300 

2,436 

170 

Robot controller 

Uthread 

27 

2,436 

110 

Speech controller 

Uthread 

211 

2,436 

34 

Camera manager 

Uthread 

84 

2,436 

20 


are passed to the move planner one at a 
time, and each one causes a sequence of 
low-level move commands and acknowl¬ 
edgments to flow back and forth be¬ 
tween the move planner, the robot con¬ 
troller, and the robot (g, h). 

Intermodel communication. The im¬ 
plementation of a single move illustrates 
two distinct styles of interaction among 
programming models: data structures 
shared between models and direct pro¬ 
cedure calls (or invocations) between 
models. Both styles of interaction re¬ 
quire synchronization between processes 
of different types. 

The board module must synchronize 
access to data structures shared by pro¬ 
cesses from the Multilisp, Lynx, Uni¬ 
form System, and Uthread environ¬ 
ments. To access these data structures, 
processes call directly into the board 
module and execute the associated code. 
When a process must block within the 
board module, the code uses pointers 
provided by the kernel to find the cor¬ 
rect block and unblock routines for the 
currently executing process type. A pro¬ 
cess that must block on a semaphore 
first places the address of its unblock 
routine in the semaphore data structure 
and then calls its block routine. When 
another process wants to release a pro¬ 
cess that is blocked on a semaphore, it 
simply retrieves the address of the ap¬ 
propriate unblock routine from the 
semaphore data structure and calls the 
routine. If protection between process 
types is desired, the appropriate rights 
can be stored with the address of the 
routines, and protected invocations can 
be required. 

There are several advantages to com¬ 


municating between models via shared 
data structures: 

• Because we use a simple procedural 
interface to access shared data, there is 
a uniform interface between models, 
regardless of the number or type of 
programming models involved. 

• Communication is efficient because 
processes can use shared memory to 
communicate directly. 

•Synchronization across models is 
efficient because of the underlying mech¬ 
anisms for implementing synchroniza¬ 
tion (a kernel pointer to user-level pro- 
cess-management routines, and a 
procedural interface to routines that 
block and unblock a process). 

• The board module resembles a black¬ 
board communication structure, but we 
can use shared data abstractions be¬ 
tween models to implement a wide vari¬ 
ety of communication mechanisms, in¬ 
cluding message channels and mailboxes. 

A different type of interaction occurs 
between the checkers player and the 
move planner: A Lynx thread calls di¬ 
rectly into the Multilisp environment of 
the move planner. Since the move plan¬ 
ner already provides exactly the func¬ 
tionality required by the checkers play¬ 
er, an intervening data structure would 
simply add unnecessary generality and 
overhead (such as the cost of extra invo¬ 
cations). Instead, every entry point ex¬ 
ported by the move planner refers to a 
stub routine designed for invocation by 
processes outside the Multilisp world. 
This stub routine copies parameters into 
the Multilisp heap and dispatches a 
Multilisp future to execute the Lisp func¬ 
tion associated with the invocation. After 


the future executes the correct Mul¬ 
tilisp function, the Multilisp runtime 
environment calls the Lynx scheduler 
directly to unblock the Lynx thread. 

Direct calls between arbitrary envi¬ 
ronments are often complicated by the 
fact that the code in each environment 
makes many assumptions about the rep¬ 
resentation and scheduling of process¬ 
es. Psyche facilitates direct calls between 
modules by separating the code that 
depends on the semantics of processes 
from the code used as an external inter¬ 
face. As a result, an application like 
Checkers can be constructed from a 
collection of self-contained modules 
without regard to the programming 
model used within each module. 


C heckers demonstrates the advan¬ 
tages of decomposing animate 
vision systems by function and 
independently selecting an appropriate 
parallel-programming model for each 
function. By extending the well-known 
software engineering principle of mod¬ 
ularity to include different parallel-pro¬ 
gramming environments, we increase 
the expressive power, reusability, and 
efficiency of parallel-programming sys¬ 
tems and applications. These proper¬ 
ties add significantly to our ability to 
build complex animate vision applica¬ 
tions. 

The entire Checkers implementation 
required only two months of part-time 
effort by five people. Our use of multi¬ 
ple parallel-programming models was 
not an artificial constraint; it was a rea¬ 
soned choice based on the tasks to be 
performed, the expertise of the people 
involved, the available software, and 
the available programming environ¬ 
ments. 

We resurrected a Lynx checkers-play- 
ing program that had been implement¬ 
ed years ago as a stand-alone program. 
The Uniform System image-analysis li¬ 
brary was plugged into Checkers after 
several years of disuse. The board inter¬ 
preter, move planner, board module, 
and move recognizer, as well as neces¬ 
sary Psyche support for the particular 
models we used, were all developed si¬ 
multaneously by people who had exper¬ 
tise in a particular problem domain and 
the related software environment. Cod¬ 
ing these modules was a part-time activ¬ 
ity extending over several weeks. 

Integration was a full-time activity 
that took only a few days. During inte- 
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gration, we made (and subsequently 
changed) many decisions about which 
modules would communicate directly 
with each other, and which should use 
the shared data structures. Our experi¬ 
ences have convinced us of the impor¬ 
tance of integration through shared data 
abstractions and customized communi¬ 
cation protocols accessible from every 
parallel-programming model. Table 2 
shows the relatively small amount of 
coding we had to do to integrate the 
various Checkers subsystems. 

Our ability to build the stylized data 
abstractions and communication proto¬ 
cols used in Checkers suggests that we 
will have little difficulty experimenting 
with alternative communication proto¬ 
cols or processor assignments. This is 
precisely the type of flexibility required 
in animate vision systems, and our ex¬ 
periences suggest that multimodel pro¬ 
gramming in general, and the Psyche 
mechanisms in particular, can provide 
the needed flexibility. ■ 
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Adapt is an 
architecture- 
independent language 
based on the split-and- 
merge model. It is 
specialized for 
efficient, parallel 
computation of image- 
processing algorithms. 


I mage processing and low-level computer vision have several charac¬ 
teristics that make them suitable for implementation with parallel com¬ 
puters: 

• The operations are regular. The same operation is performed repeatedly across 
an image. This makes load-balancing easy. 

• The programs follow a stereotypical form. They scan the image in raster order 
and produce results by incrementally modifying a data structure. This means that 
a good parallelizing technique is likely to apply to many algorithms. 

• The algorithms are not “deep. ” They are not complex algorithms that can be 
implemented only with great difficulty. This means that reprogramming is relative¬ 
ly simple, making it possible to take advantage of new architectures given suitable 
programming tools. 

• There is no “dusty deck” syndrome. While code libraries exist, reprogramming 
is common. Compiling and executing huge Fortran programs written decades ago 
is not required, as it is in scientific computing. This means that it is not necessary 
to automatically parallelize existing programs. 

• The algorithms are stable. The software development required for parallel 
processing need not occur in an environment of constantly changing algorithms. 

• Images provide a natural source of parallelism. They constitute large datasets 
and parallel algorithms can often be created by partitioning the image into 
sections. 

• Many applications need the speedups that parallel processing offers. In ro¬ 
botics tasks, image processing is often a critical bottleneck that affects system 
speed. In remote-sensing applications, the enormous size of images and their 
quantity make processing time a limiting factor in what can be learned. Only 
parallel computers can achieve the necessary speedups while still offering the 
required programmability, as has been demonstrated by the many parallel com¬ 
puters developed for image processing. 
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These characteristics make image pro¬ 
cessing and low-level computer vision 
ripe for parallel processing. However, 
programming parallel computers now 
is like programming serial computers 
was in the 1960s. Each parallel comput¬ 
er has its own languages now, just as 
each serial computer then had its own 
assembly language, which would run 
efficiently on one computer only. Then 
as now, users get locked into a particu¬ 
lar manufacturer’s computers, because 
they can not move their programs to 
another computer without enormous 
difficulty. 

There are currently high-level pro¬ 
gramming tools for parallel computers, 
just as there were high-level languages 
then for serial computers; but these tools 
tend to be idiosyncratic to particular 
machines. More important, using them 
involves sacrifices in efficiency. In the 
past, programmers for serial computers 
still wrote their inner loop bodies in the 
assembly language for particular com¬ 
puters, trading portability for speed. 
Similarly, they use low-level tools today 
to get the best performance for parallel 
computers. As a result, users cannot 
share code libraries, which greatly in¬ 
creases programming costs. 

At Carnegie Mellon’s Adapt project, 


we expect the future of parallel comput¬ 
ing to follow the pattern set by serial 
computing. For example, serial com¬ 
puting developed higher level languag¬ 
es that hid the underlying machine 
architecture and implemented program¬ 
ming models for different applications 
areas: Fortran for scientific computing 
and Cobol for business applications. We 
expect the development of such higher 
level languages for parallel computers. 
Serial computing’s compiler technolo¬ 
gy improved with time, until the best 
compilers could compete with the best 
human programmers. High-quality as¬ 
sembly language programming was no 
longer a valuable skill. The same will 
happen with parallel computers. Even¬ 
tually, advances in compiler technology 
fed back into processor design and led 
to improved processor architectures such 
as reduced instruction-set computing 
(RISC). Similarly, as we understand 
parallelism better, we will be able to 
systematically identify useful and use¬ 
less features, leading to improved par¬ 
allel computer design. 

The higher level languages that were 
developed for serial computers incor¬ 
porated a general programming model 
and could be applied to any problem, 
although they were specialized for par¬ 


ticular areas. For example, it was possi¬ 
ble — though inconvenient — to use 
Cobol for scientific computing. Although 
such flexible programming is not yet 
possible for parallel computers, some¬ 
thing almost as good is: specialized, ar¬ 
chitecture-independent languages cre¬ 
ated for different applications areas. 

Because image processing is such an 
important application area for parallel 
computers, it makes sense to provide 
such languages for this area. Apply and 
Adapt are both languages that make it 
possible to write certain types of image- 
processing operations, while providing 
the highest possible level of architec¬ 
ture independence. (See the sidebar dis¬ 
cussion of the levels of architecture in¬ 
dependence that such languages can 
support.) 

Apply is a simple, architecture-inde¬ 
pendent language for local image-pro¬ 
cessing operations. Local operations 
produce an output pixel based on a small 
window surrounding the corresponding 
input pixel; they include edge detection 
and smoothing. (See the sidebar on Ap¬ 
ply for more information.) 

Adapt allows the definition of local 
operations as well as global operations, 
in which an output pixel can depend on 
many or all input pixels. Histogram and 


Levels of architecture independence 


A parallel programming language can provide different lev¬ 
els of architecture independence, depending on how much of 
the underlying architecture is hidden from the programmer: 

• Processor architecture. This is the level of architecture 
independence provided by conventional serial processor lan¬ 
guages. For example, the number of registers and the pres¬ 
ence or absence of a math coprocessor are hidden. 

• Array size. This level hides the size of the processor ar¬ 
ray. The array topology and intercommunication scheme 
must be the same. The data parallel languages for the Con¬ 
nection Machine’ provide this level of architecture indepen¬ 
dence. It is extremely useful, allowing the code development 
on a small array of nodes and production runs on large ar¬ 
rays, as well as upgrades of processor arrays to larger sys¬ 
tems without changing software. However, it does not allow 
the movement of code from one architecture to another. 

• Array topology. This level supports changes in the shape 
of the underlying processor array as well as its size. Pro¬ 
cesses can be placed on whatever processors are conve¬ 
nient, and the programming system takes care of managing 
communication using whatever array topology is available. 

• Interprocessor communication. This level completely 
hides interprocessor communication. For example, Linda 2 


manages interprocessor communication by supporting a “tu¬ 
ple” model in which processes place tuples in a common da¬ 
tabase and read them from there. The Uniform System 3 also 
provides this level of architecture independence. 

• Process management. This is the highest level of archi¬ 
tecture independence. The existence of multiple processes 
— as well as all communication — is hidden from the pro¬ 
grammer. The program uses a model that does not make any 
mention of multiple processes, and the compiler takes care 
of splitting the work into multiple processes and allocating 
data. This potentially allows SIMD, MIMD, and serial archi¬ 
tectures to be programmed efficiently with the same program. 
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other feature-extraction operations are 
global operations. Adapt is based on 
the split-and-merge programming 
model. 

Split-and-merge 
programming model 

There are many different program¬ 
ming models for parallel computers. 
Some models divide the work by assign¬ 
ing each processor a different stage in a 
function. Other models assign each pro¬ 


cessor a different subset of the data. 
Within the latter class of models, called 
data-partitioned models, the work can 
be divided in different ways. One way is 
to provide an evaluation function to 
decide where to put data. In the split- 
and-merge model, the data is split blind¬ 
ly, according to data position, and the 
separately computed adjacent results 
are then merged. 

Two important questions must be 
answered for any parallel programming 
model: What class of computations can 
be done with it and what kinds of paral¬ 
lel computers are suited for its use? 


With the split-and-merge model, we can 
perform any operation that can be com¬ 
puted in forward or reverse order over 
a data structure. Thus, the model is very 
general, but it cannot be applied when 
the serial computation includes a de¬ 
pendency that “matters.” If the compu¬ 
tation has such a dependency, it is not 
possible to compute the same result while 
taking the data in reverse order. Two 
examples of such a dependency from 
image processing are error-diffusion 
halftoning, which propagates an error 
through an image in a particular order, 
and some region-segmentation algo- 


The Apply language 


Apply 1 was targeted to local, low-level vision operations such 
as edge detection, smoothing, and point operations. The lan¬ 
guage is based on the observation that such programs follow a 
stereotypical form: 

for each row 

for each column 

produce an output pixel based on a small window 
of pixels around the current row and column in the 
input image. 

Apply exploits this idea by requiring the programmer to write 
only the innermost “per pixel” portion of this computation. The 
iteration is then implicit and can easily be made parallel. 

For example, Figure A shows an Apply program for the Sobel 
operator. The procedure header defines the input and output 
parameters. These include one input image, from which a 3 x 3 
window is taken, and one output image. (Both are byte imag¬ 
es.) There is also a parameter, const, that determines the type 
of Sobel operator being computed. The body of the procedure 
shows how to compute the Sobel operator. The indexing of im- 
agein is relative to the current row and column being pro¬ 
cessed. 


Apply has been implemented on computers ranging 
from the 10-cell MIMD Warp machine 2 to the 64K-proces- 
sor SIMD content addressable array parallel processor 
(CAAPP) at the University of Massachusetts. 3 Apply’s sim¬ 
plicity and ease of implementation result from its restricted 
programming model and its phrasing in terms of elemental 
data parallelism. The programmer describes the program 
in terms of the smallest meaningful unit — namely, a win¬ 
dow taken around a pixel in an image. Because the pro¬ 
gram is specified in this way, the compiler needs only to 
divide the images among processors and then iterate the 
Apply program over the image sections allocated to each 
processor to get an efficient parallel image-processing 
program. The target-language compiler can perform opti¬ 
mizations. 

Other data-parallel languages specify the program in 
terms of primitive operations on large data objects. On im¬ 
ages, for example, the following operations might be 
specified: “shift image to the left” and “add images." To 
generate an efficient program in such languages on MIMD 
machines, the compiler must divide the data objects 
among processors and combine several primitive opera¬ 
tions, performing optimizations as needed. 


procedure egsbl (imagein: in array (-1..1, -1..1) of byte border 0, 1 
type: const integer, 2 

imageout : out byte) 3 

is 4 

horz, vert: integer; 5 

begin 6 

horz := imagein (-1,-1) + 2*imagein (-1, 0) + imagein (-1,1) - 

imagein (1,-1) - 2*imagein (1, 0) - imagein (1,1); 7 

vert := imagein (-1,-1) + 2*imagein ( 0,-1) + imagein (1,-1) - 

imagein (-1,1) - 2*imagein ( 0,1) - imagein (1,1); 8 

if type = 1 then imageout := sqrt (horz*horz + vert*vert); 9 

else imageout := abs (horz) + abs (vert); 10 

end if; 11 

end egsbl; 12 
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Figure A. Apply Sobel operator. 
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rithms that merge regions while scan¬ 
ning the image in a particular order. 

The split-and-merge model works on 
any reversible computation (as shown 
in the sidebar), but when implemented 
on a distributed-memory parallel com¬ 
puter, it requires transfer of global data 
between processors and storage within 
each processor’s memory for variables 
to be merged. Thus, the split-and-merge 
model is best suited for operations in 
which the size of global data to be pro¬ 
duced is small. These include feature- 
extraction operations, such as histogram 
and connected-component algorithms. 
Other algorithms, such as Hough trans¬ 


form, can be programmed using this 
model, but they will not perform well on 
distributed-memory parallel computers. 

The split-and-merge model runs more 
efficiently on some machines than oth¬ 
ers. The following sections address the 
characteristics of parallel computers that 
affect its performance. 

Processor number. To consider what 
size processor arrays can implement the 
split-and-merge model, we will first de¬ 
velop formulas for execution time un¬ 
der it. 

In the split-and-merge model, we take 
an existing serial computation function 


Split-and-merge computations 


When a data structure is processed in the split-and-merge model, the compu¬ 
tation is split over the structure. Without loss of generality, this data structure is a 
linear sequence [a,, .... a„]. (Other data structures — for example, an image — 
can be mapped onto such a sequence by processing them in a particular order 
— for example, raster order.) This sequence is divided into sections 

[a,.a,,], [a,- i+1 .aj,..., [a,^,. a„] 

Each section is processed independently and the partial results are then com¬ 
bined. 

We will now show that the class of computations that can be done with such a 
model is exactly the class of computations that can be done in forward or reverse 
order over [ai.a„]. 

First, any computation that can be divided in this way can be done in forward 
or reverse order, because it can be divided into sequences of length 1 and each 
length-1 sequence can be processed independently. These sequences can then 
be combined either from the beginning to the end, or from the end to the begin¬ 
ning. 

If the sequence can be processed in either forward or reverse order, then it 
can be processed with the split-and-merge model. To show this, let the serial 
function that processes the data structure in order one element at a time be 

called A. Write A [a,.aj for A(a r A(A H , .... A(a))) — that is, the serial function 

applied to all the elements in the sequence, starting at the first and working for¬ 
ward to the last. Write A R for the reversed version of the serial function. 

Call the function that combines the independently computed results from two 
subsequences C. The point of this part of the proof is to define C so that C(A [a„ 

..., a y ], A [a^,.aj) = A [a f , ..., aj for any two subsequences [a,.a ; ] and [a M , 

.... aj. We do this by “guessing” a sequence [to,,.... toj so A [to,, .... to,] = A [a,. 

a,J. (We must guess this sequence because we do not, at this point, have the ac¬ 
tual values of the [a,.a ; ] available to us; all we have is A [a„ .... a,]. We can al¬ 

ways guess the sequence by simply enumerating all possible sequences and 

testing which of them gives the same result as [a,. a,]. Therefore, this step is 

computable.) 

We then define 

C(A [a„ a], A [a h , .aj) = A R (b u A R (b 2 . A R (b„ A[A h , .aj))) 

We can now show that C satisfies the requirement above, using the reversibili¬ 
ty of A\ 

A [a,.aj = A(a k ,..., A(a k )) = A(a k . A(a hu A(a, . A(a k )))) 

= A(a k . A(a h „ A(b„ ..., A(b,)))) = 4 n (to,. A R (b„ A R (a *,. A R (a k )))) 

= A R (b, . A R (b„ A(a k . A(a,)))) = A R (b u ..., A R (b„ A[a h . a k ])) 

= C(A [a, .aj, A [a h , .a*]) 



\ 
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Figure 1. Execution time with the 
split-and-merge method. 


A and augment it with a combining func¬ 
tion C that allows results to be comput¬ 
ed independently and then combined. 
Any loss of efficiency in the parallel 
computation comes from the function C 
(and from data transmission between 
processors, which we can include in C). 

The efficiency of the computation 
depends on 

(1) the number of processors P. 

(2) the method of combining results 
produced by different processors; 
for example, serially or in a bina¬ 
ry tree. 

(3) the time needed for combining 
two independently produced re¬ 
sults as compared with the time 
needed to produce them. 

We write N for the number of data to be 
processed. To simplify the discussion, 
we assume that the time needed to pro¬ 
cess a single datum with A is constant 
(call it a). Thus, the time a processor 
needs to process its data is a(NIP). Af¬ 
ter each processor processes its data, 
the partial results must be combined. 
We assume the time for this (including 
communication) is also constant, and 
call it c. 

On a serial computer, the total execu¬ 
tion time is T, = aN. On a parallel com¬ 
puter where results are combined se¬ 
quentially, the execution time is T, = 
a(NIP) + cP. If we combine results in a 
binary tree, the execution time is T b = 
a(NIP) + c log P. If we do the combining 
operations as we might on a two-dimen¬ 
sional mesh — first combining all rows 
into one column and then combining 
that column, the execution time is T 2 = 
a(NIP) + 2<rJP. 

Figure 1 gives typical execution time 
behavior. Initially, execution time de¬ 
creases linearly as we add more proces¬ 
sors. The execution time curve passes 
through a “knee” where the decrease in 
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Table 1. Optimal processors for the split-and-merge model. 


Algorithm 

ale 

Image Size 

a 

Serial 

Combining Method: 
Two-Dimensional 

Binary Tree 

Image sum 

1 

512 x 512 

260,000 

300 

160 

16,000 



10K x 10K 

100,000,000 

5,800 

85,000 

4,000,000 

Histogram 

0.004 

512x512 

1,000 

18 

40 

106 



10K x 10K 

390,000 

360 

2,100 

22,000 

Connected 

0.001 

512 x 512 

260 

9 

16 

32 

components 


10K x 10K 

10,000 

41 

120 

430 


execution time starts to taper off as the 
overhead of more and more combining 
operations builds up. Eventually, the 
combining overhead balances the speed¬ 
up from using more processors, and the 
curve reaches a minimum. Thereafter, 
adding more processors actually increas¬ 
es execution time. 

Clearly it is not cost-effective to use 
so many processors that the decrease in 
execution time as we add another pro¬ 
cessor is near zero. Instead, we would 
rather be near the knee of the curve. 
The precise location of this point can be 
found. 1 It depends only on how partial 
results are combined and on the num¬ 
ber a = ( aN)lc . For sequential combin¬ 
ing, the knee occurs at (a/3) 1/2 . With 
combining as on a two-dimensional 
mesh, the knee occurs at (a/4) 2 ' 3 . With 
binary tree combining, the knee occurs 
at the P that satisfies P = (a log 2) / (log 
P + 2). 

In image processing, the number a 
depends on the number of pixels in the 
image, which ranges from tens of thou¬ 
sands in industrial vision applications 
to hundreds of millions in reconnais¬ 
sance applications, and on the ratio of 
execution time of the A and C func¬ 
tions, ale. This ratio varies, but is usual¬ 
ly quite small: less than 0.01. For exam¬ 
ple, in histogram algorithms, the A 
function increments a histogram ele¬ 
ment, while the C function must add 
two histograms together, so this ratio is 
0.004 or less. In the connected-compo¬ 
nents algorithm discussed later, the ra¬ 
tio is less than 0.001. 

Table 1 gives the optimal processor 
numbers for two image sizes and three 
algorithms under three different com¬ 
bining methods. We see from this table 
that the number of processors that can 
be effectively used under the split-and- 
merge model is on the order of hun¬ 
dreds, not thousands, except with enor¬ 
mous images and algorithms in which 


the combining function is trivial or non¬ 
existent (as in local operations like the 
addition of two images). 

Memory size. The split-and-merge 
model combines the results from differ¬ 
ent processors into a single final result. 
As implemented in Adapt, there is no 
ordering restriction on the accesses to 
the data structures being combined. 
Therefore, for efficient implementation, 
these data structures must be placed in 
a memory that can be efficiently access¬ 
ed by a processor, such as the proces¬ 
sor’s local memory or a common shared 
memory. As a result, in the distributed- 
memory case, the memory at each pro¬ 
cessor must be large enough to contain 
this final result. This means that split- 
and-merge processor arrays must have 
relatively large memories at each pro¬ 
cessor or must share access to a relative¬ 
ly large common memory. 

Control. In the split-and-merge mod¬ 
el, computations are done independently 
at each processor, and then these re¬ 
sults are combined. This creates oppor¬ 
tunities for the exploitation of either 
single instruction, multiple data (SIMD) 
or multiple instruction, multiple data 
(MIMD) control. With SIMD, control 
for the independent computations in 
the first phase can be factored out, with 
some decrease in hardware cost. With 
MIMD, each processor must maintain 
its own instruction memory and con¬ 
troller, but it can take advantage of 
independent branching in conditional 
operations and reduce overall compu¬ 
tation time correspondingly. 

Interprocessor bandwidth. Interpro¬ 
cessor bandwidth is critical in the split- 
and-merge model. Table 1 indicates a 
severe limitation in the number of pro¬ 
cessors that can be effectively exploit¬ 
ed, even assuming favorable values such 


as 0.001 for ale. With low interprocessor 
bandwidth, this ratio increases rapidly, 
because results must be exchanged be¬ 
tween processors during the combining 
phase. Because the combining phase is 
so critical for use of more than a few 
processors, interprocessor bandwidth 
must be very high. 

Interprocessor latency. This is not a 
critical issue for the split-and-merge 
model. The input data can be distribut¬ 
ed to all processors, local computation 
can be done independently, and then 
the combining phase can be done with 
its exchange of variables between pro¬ 
cessors. Nothing in this implies that one 
processor must wait repeatedly for an¬ 
other to complete before it can do its 
computation. 

Array topology. There are at least 
three different ways of implementing 
the combining operation according to 
the underlying array topology: 

• serial combining on a one-dimen¬ 
sional processor, 

• two-level merging on a two-dimen¬ 
sional processor, and 

• binary tree merging on a processor 
with hypercube or similar high-di- 
mension interconnections. 

Low-dimension interconnections can be 
made with high bandwidth and low la¬ 
tency, making it possible to use more 
processors because of a faster combin¬ 
ing function. But high-dimension inter¬ 
connections can implement a more effi¬ 
cient combining structure, again making 
it possible to use more processors. 

The Adapt language 

Adapt is an implementation of the 
split-and-merge model for image pro- 
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cessing. Adapt is similar to 
Apply, except that in Adapt 
the programmer describes both 
the operation to be performed 
at every pixel of the image (the 
body of the program in Apply, 
called the Next section in 
Adapt), as well as a combining 
operation to combine two re¬ 
sults produced independently 
at different processors (called 
the Combine section). The pro¬ 
grammer may also describe an 
operation to be performed 
before any other processing is 
done (called the First section) 
and an operation to be done 
after the completion of all pro¬ 
cessing (called the Last sec¬ 
tion). The sidebar lists and 
describes these four sections 
of an Adapt program. 

As demonstrated earlier in 
the discussion of processor 
number, the split-and-merge 
model does not make enough 
parallelism available to use 
more processors than there are 
rows in an image. Therefore, 
Adapt is oriented towards par¬ 
titioning the image by rows: 
First can be run only at the 
beginning of a row, and Next is 
allowed to reuse previous re¬ 
sults. This makes Adapt pro¬ 
grams easier to write and po¬ 
tentially more efficient. 


procedure histogram ( 
im : in image byte, 1 

hist: out array (0..255) of float) 2 
is 3 

count: array (0..255) of integer; 4 

first begin 5 

for i in 0..255 loop 6 

count (i) := 0; 7 

end loop; 8 

end first; 9 

next begin 10 

count (im) := count (im) + 1; 11 

end next; 12 

combine begin 13 

for i in 0..255 loop 14 

count (i) := count (i)+_count (i); 15 

end loop; 16 

end combine; 17 

last 18 

pixels: integer; 19 

begin 20 

pixels := 0; 21 

for i in 0..255 loop 22 

pixels := pixels + count (i); 23 

end loop; 24 

for i in 0..255 loop 25 

hist (i) := float (count (i))/pixels; 26 
end loop; 27 

end last; 28 

end histogram; 29 


Figure 2. Adapt histogram. 


Sections of an Adapt program 

An Adapt program has four sections: 

First An initialization function that can be run at the beginning of any row of 
the image. 

Next A section applied in raster order across the image (wrapped around at 
the borders of the image). Each execution of Next is preceded either by another 
execution of Next (for the previous pixel) or by First (at the beginning of a row). 
As in Apply, images processed in the Next section are referenced relative to the 
current pixel position. 

Combine: A merging function that combines the outputs of any two image re¬ 
gions to produce an output for the concatenation of the two regions. To make 
programming easier, Combine is applied only to adjacent groups of consecutive 
rows of the image. Combine can reference any variable used elsewhere in the 
program, because all variables retain their values. Its output is also expressed in 
terms of these variables. Within the Combine section, variables preceded by an 
underscore refer to the values of variables in the lower image region, and vari¬ 
ables not preceded by an underscore refer to values from the upper image re¬ 
gion. The result of the Combine section assigns the correct values for the 
merged region to variables not preceded by an underscore. 

Last A termination function applied once after the output of the entire image is 
computed. 


There are four types of parameters in 
Adapt programs: input and output im¬ 
ages and other input and output param¬ 
eters called global parameters. Images 
are two-dimensional arrays that the Next 
section iterates over. Input global pa¬ 
rameters are available throughout the 
Adapt program; their values cannot be 
changed. Output global parameters and 
other variables are given values as a 
result of processing in the First, Next, 
and Last sections. The Combine section 
combines the values of these variables 
from two different regions of the image. 

Figure 2 shows an Adapt program for 
the histogram algorithm to illustrate these 
concepts. The procedure header (lines 1 
through 2) declares one input image and 
one output global variable, the histo¬ 
gram. The final histogram resulting from 
this program gives the frequency of each 
pixel value, so the histogram is declared 
a floating-point value. Line 4 declares 
an internal global program variable, 
count; it is available to all sections of the 
program. 

The First section (lines 5 through 9) 
zeroes an integer histogram, called count, 
that the program uses internally to avoid 
floating-point computation. The Next 
section (lines 10 through 12) increments 
the count element for the current pixel. 
The Combine section (lines 13 through 
17) adds the count arrays from the upper 
and lower image regions and stores the 
result in the histogram for the merged 
region. Note the presence of the under¬ 
score (line 15) before the version of 
count that comes from the lower region. 
Finally, the Last section (lines 18 through 
28) adds all the histogram elements to¬ 
gether to get the total number of pixels 
in the image and divides the frequency- 
count table by this total to get the final 
frequency count. 

The exact semantics of Adapt can be 
understood in terms of invariants ap¬ 
plied over the input and output images. 
The First section establishes the invari¬ 
ant for the null region starting and end¬ 
ing at the first pixel of the current row. 
Given that the invariant is true from 
some starting row up until the current 
pixel, the Next section extends the in¬ 
variant by one pixel in raster order. The 
Combine section takes two versions of 
the global program variables from two 
adjacent regions of rows and makes the 
invariant true for the two regions merged. 
Given that the invariant is true for the 
entire image, the Last section establish¬ 
es the final result of the program. 
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We can illustrate these se¬ 
mantics by once again exam¬ 
ining the histogram program. 
The invariant is 

count(jc) is the number 

of pixels having value x. 

To establish this for a null 
region, count(x) must be set 
to 0 for all x\ the First section 
does this. The Next section 
extends the invariant by in¬ 
crementing the appropriate 
element of count. The Com¬ 
bine section must add the two 
versions of count together to 
keep the invariant true, and 
Last uses count to compute 
the final result of the pro¬ 
gram, the pixel-frequency dis¬ 
tribution. 

The following sections il¬ 
lustrate the features of the 
Adapt language and its use 
by presenting some example 
Adapt programs. 

Fast uniform convolution. 

A fast algorithm exists for M 
x N uniform convolution (that 
is, convolution with a rectan¬ 
gular filter with constant val¬ 
ues). The algorithm maintains 
an array of sums of the last M 
elements of each column. The 
sum of the last N of these 
column sums is also main¬ 
tained. This is the “box sum,” 
which becomes the convolu¬ 
tion result when multiplied 
by a constant. The algorithm 
scans the image in raster or¬ 
der and processes the next 
pixel as shown in Figure 3. 
First, the column sum for the 
next column is updated by 
subtracting the image pixel 
from the row M previous and 
adding the current image pix¬ 
el. Next, the box sum is up¬ 
dated by subtracting the col¬ 
umn sum from the column N 
to the left and adding the cur¬ 
rent column sum. Thus, this 
algorithm calculates the uni¬ 
form convolution in only four 
addition operations, instead 
of the MN operations need¬ 
ed if the box sum were calcu¬ 
lated independently at each 
pixel. 



If we use the array colsums 
to store the column sums and 
boxsum to store the box sum, 
and if im is the image, then a 
natural invariant for uniform 
convolution is: 

For region [(a,0), (P,y)]: 
l.c<y + /V/2=> 

^colsums(c) = X ' rn ( / "’ c )j 


procedure uconv ( 
inimg : in image 
array (-M/2-1..M/2, -N/2..N/2) 
of byte border mirrored. 30 

weight: in float. 31 

outimg : out image byte) 32 

is 33 

colsums : array (-N/2..Cols+N/2) of integer; 34 
boxsum: float; 35 

startrow : integer; 36 

first begin 37 

startrow := row; 38 

boxsum := 0; 39 

for c in -N/2..N/2 loop 40 

colsums (c) := 0; 41 

for r in -M/2..M/2 loop 42 

colsums (c) := colsums (c) + inimg (r,c); 43 

end loop; 44 

boxsum := boxsum + colsums (c); 45 

end loop; 46 

end first; 47 

next begin 48 

if startrow = row then 49 

colsum (col+N/2) := 0; 50 

for i in -M/2..M/2 loop 51 

colsums (col+N/2) := colsums (col+N/2) + 
inimg (i,N/2); 52 

end loop; 53 

else 54 

colsums (col+N/2) ;= colsums (col+N/2) - 
inimg (-M/2-1, N/2) + inimg (M/2,N/ 2); 55 

end if; 56 

if col = 0 and row > startrow then 57 

boxsum := 0; 58 

for c in -N/2..N/2 loop 59 

boxsum := boxsum + colsums (c); 60 

end loop; 61 

else 62 

boxsum := boxsum - colsums (col-N/2-1) + 
colsums (col+N/2); 63 

end if; 64 

outimg := boxsum * weight; 65 

end next; 66 

end uconv; 67 


Figure 4. Adapt uniform convolution. 


2. c> y + A/2 =* 

|coIsums(c) = X im(r,c)j 

3. boxsum = ^ colsums (c) 

The problem with this invari¬ 
ant is that we cannot estab¬ 
lish it efficiently. The First 
section is the natural place to 
establish parts 1 and 2, but we 
cannot access the entire im¬ 
age in this section. Only a 
window around the first pixel 
in the row can be read, so part 
2 cannot be established. 
Hence, we must instead es¬ 
tablish the invariant in the 
Next section, which leads to 
MN add operations per pixel. 

However, we can overcome 
this problem by modifying the 
invariant and establishing part 
2 while the Next section scans 
the first row. We simply es¬ 
tablish parts 1 and 3 of the 
invariant in the First section, 
and note which row we are 
processing. Part 2 of the in¬ 
variant is established as we 
process this row. 

The invariant then be¬ 
comes: 


1. c<i + NI2=> 

[ colsums (c)= X im ( r > c ) I 
1, HJ-m/2 ) 

2. startrow = a 

3. p* startrow a i > y =» 

colsums (c) = X im(r,c)j 

4. boxsum = X colsums ( c ) 


Figure 4 shows the Adapt 
program for uniform convo- 
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lution. Lines 38 through 46 
establish all four parts of the 
invariant for the first column 
in a region. Lines 50 through 
53 establish part 1 for col¬ 
umns in the first row, while 
line 55 establishes this step 
for other rows. Lines 58 
through 61 establish part 3 
for the first column in a row 
(unless already established by 
First.) The update to boxsum 
for the next pixel is in line 63. 
The total number of adds per 
pixel is only four, with IV ex¬ 
tra adds required for each row 
and MN extra adds per im¬ 
age. On a parallel computer 
where each processor takes 
one swath of PRows rows of 
the image, the execution time 
in add operations is MN + 
PRows (1V + 4 x Cols). 

Adapt run-length encoding. 

Run-length encoding is a sim¬ 
ple image compression oper¬ 
ation. An image is scanned in 
raster order and each run of 
identically valued pixels of 
length n and value g is changed 
into an ordered pair (g, n -1). 
(The length is represented as 
n -1 instead of n because any 
run is of length at least 1, so 
n - 1 allows the representa¬ 
tion of runs one pixel longer 
than would otherwise be the 

An invariant for run-length 
encoding is 

After processing pixel k 
in raster order, the run- 
length encoded 
representation is {(g„ n,), 

C ?i . « 2 ), (g m , n m ) 1 if, 

and only if, the image is 

g"' + ‘g" 2+1 -gm +t ’ 


where a b represents a sequence of 
b a' s, and g, * g M for any i. 

A problem with this invariant is that the 
run-length encoded representation it 
produces is difficult to invert in parallel. 
To determine a pixel’s value in the run- 
length encoded representation requires 
stepping through all the data in the rep¬ 
resentation up to that point. This makes 
it impossible to do run-length decoding 


Figure 5. Run-length encoding. 


procedure rle ( 

im : in image byte, 

g : out array (0..MAXSIZE-1) of byte, 
n : out array (0..MAXSIZE-1) of byte, 
Start: out array (O..Rows-l) of integer, 
m : out integer) is 

1 

NRows: integer; 

2 

first begin 

3 

m := -1; 

4 

NRows := 0; 

5 

end first; 

6 

next begin 

7 

if col = 0 or else im /= g (m) or 

8 

n (m) = 255 then 

9 

m := m + 1; 

10 

g (m) ;= im; 

11 

n (m) := 0; 

12 

if col = 0 then 

13 

Start (NRows) := m; 

14 

NRows ;= NRows + 1; 

15 

end if; 

16 

else 

17 

n (m) := n (m) + 1; 

18 

end if; 

19 

end next; 

20 

combine 

21 

offset: integer; 

22 

begin 

23 

if NRows > 0 then 

24 

offset := m+1; 

25 

else 

26 

offset ;= 0; 

27 

end if; 

28 

for i in l.._NRows loop 

29 

Start (NRows) := 

30 

_Start (i-1) + offset; 

31 

NRows := NRows + 1; 

32 

end loop; 

33 

for i in 0.._m loop 

34 

n (m+i+1) :=_n (i); 

35 

g (m+i+1) ;= _g (i); 

36 

end loop; 

37 

m := m + _m + 1; 

38 

end combine; 

39 

end rle; 

40 


in parallel. However, a small change to 
the invariant allows this. 

All we have to do is maintain a table 
of the starting positions of each row of 
data in the original image. To do this, 
we must split runs of data across image 
boundaries. Doing so is unlikely to de¬ 
crease the compression by much, since 
pixels at opposite borders of the image 
are probably unrelated. We will main¬ 
tain an array, called Start, that indicates 


the position in the run-length 
encoding where each row 
begins. To define such an ar¬ 
ray, runs must be split across 
image borders. An invariant 
for such a program is the in¬ 
variant above combined with 

The ith row in the region 
starts at position 

"l + «2 + - + «S.art(i)-l- 

Figure 5 shows an Adapt pro¬ 
gram for this invariant. 


Adapt 

implementations 

On a serial machine, an 
Adapt program can be im¬ 
plemented as shown in Fig¬ 
ure 6. In this implementation, 
the First section is executed 
once, at the beginning of the 
image. Then the Next section 
executed across the image 
in raster order. Finally, the 
Last section is executed. The 
Combine section is not exe¬ 
cuted at all. 

On a parallel machine, the 
input image can be divided 
either by rows or by columns. 
Figure 7 shows one possible 
implementation if the image 
is divded by rows. Here each 
processor takes a horizontal 
swath of the image and exe¬ 
cutes the First section at the 
beginning of the swath. The 
Next section is executed by 
each processor across its 
swath. Then Combine func¬ 
tions are used to combine the 
results computed indepen¬ 
dently by the processors. Fi¬ 
nally, the Last section is exe¬ 
cuted. 

Figure 8 shows a possible 
implementation if the image is parti¬ 
tioned by columns. For each row, a pro¬ 
cessor executes the First section, then 
runs Next functions across its portion of 
the image. Then it transfers its interme¬ 
diate variables to another processor. 
This second processor runs Next func¬ 
tions on its portion of the image, then 
transfers its intermediate results to a 
third processor. The last processor in 
this chain executes Next functions on its 
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Figure 6. Adapt serial implementa- 
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Figure 7. Row-partitioned implemen¬ 
tation of Adapt. 
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Figure 8. Column-partitioned implementation of Adapt. 



processor array. 


portion of the image and 
then executes a Combine 
section to combine the re¬ 
sults just computed with the 
results for the previous por¬ 
tion of the image. After the 
entire image is processed, 
the Last section is execut¬ 
ed. 

The column-partitioned 
implementation is usually 
less efficient than the row- 
partitioned implementa¬ 
tion. Intermediate results 
must be passed between 
processors, and each pro¬ 
cessor must wait for the 
previous processor to com¬ 
plete before it can begin. 

(The processors may still 
be working all at the same 
time in a pipeline.) The tight 
coupling between proces¬ 
sors and the need for a lot 
of interprocessor commu¬ 
nication may reduce per¬ 
formance on many parallel 
computers. However, the 
column-partitioned imple¬ 
mentation has two advan¬ 
tages. First, it stores a small¬ 
er amount of input image 
at each processor, so it can 
be used when the proces¬ 
sors have small memories. 

Second, it runs the Com¬ 
bine section in parallel with 
the Next section, whereas 
the row-partitioned imple¬ 
mentation executes all the 
Combine functions after all 
the Next functions. 

Figure 9 shows how a two- 
dimensional processor 
mesh increases efficiency by 
taking advantage of the 
flexibility in executing 
Combine sections. The im¬ 
age is still partitioned by 
rows, but now the Combine 
section is first executed hor¬ 
izontally across the processor array in 
parallel. Then it is executed vertically. 
This method increases the number of 
processors that can be applied to an 
image processing problem because it 
reduces the overhead for the Combine 
step. 

In Figure 7 and Figure 9, the Com¬ 
bine operations can be done serially, as 
illustrated, or in a binary tree. The bina¬ 
ry tree is more efficient, as discussed in 


the section on processor number, but 
requires communication between non- 
adjacent processors. 

Performance 

Adapt has been implemented on 

• Unix serial architectures (Sun 4/330 
systems), using the serial imple¬ 


mentation method; 

•the Carnegie Mellon 
Warp machine 2 in two 
separate implementa¬ 
tions, one using row par¬ 
titioning and the other 
using column partition¬ 
ing; 

•the Carnegie Mellon 
Nectar computer, using 
row partitioning in a 
task-queue model (the 
image is partitioned into 
sections of rows, and 
these row sections are 
allocated to processors 
upon their request, pro¬ 
viding automatic load 
balancing); and 
• the Carnegie Mellon In¬ 
tel Corporation iWarp 
computer, using binary 
tree combining. 

Results from the Sun and 
Warp implementations are 
presented here. (The Nec¬ 
tar implementations are 
currently inefficient, as the 
prototype Nectar system 
requires transferring im¬ 
ages twice across a VME- 
bus, completely offsetting 
any performance improve¬ 
ment from the use of mul¬ 
tiple processors. Results 
from the iWarp implemen¬ 
tation are still prelimi¬ 
nary.) 

Tables 2 and 3 on the 
next page give the execu¬ 
tion times of Adapt pro¬ 
grams on Warp and the 
Sun 4/330 and, where they 
exist, the execution times 
of equivalent Warp hand- 
coded programs. In most 
cases, the Warp hand- 
coded programs were 
carefully optimized for 
execution time. 

This data can be used in several 
ways: 

• to evaluate the performance of 
Adapt, 

• to compare the performance of dis¬ 
similar architectures, and 

• to evaluate different algorithms for 
the same problem. 

In evaluating the performance of 
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Table 2. Execution times, in seconds, of Adapt programs for 512 x 512 image processing. 


Program 

Warp 

Hand-Coded 

Warp Adapt 
Column-Partitioned 

Warp Adapt 
Row-Partitioned 

Sun 4/330 
Adapt 

Histogram 

0.143 

3.430 

0.119 

0.163 

Minimum bounding rectangle 

1.250 

8.460 

0.143 

1.300 

Run-length encoding 


2.910 

0.207 

0.216 

Run-length decoding 


0.271 

0.189 

0.166 

3x3 median filter 

0.779 

1.850 

0.779 

1.830 

Halftoning using Gibbs sampler 
Pattern generation: 


2.500 

0.576 

1.800 

Checkerboard 

0.106 

0.253 

0.178 . 

0.200 

Stripes 

0.151 

0.334 

0.280 

1.300 

Bull’s-eye 

0.140 

0.310 

0.274 

4.370 

Diamonds 

0.926 

0.395 

0.338 

2.530 

Grid 

0.131 

0.234 

0.178 

0.183 


Table 3. Execution times for connected-components algorithms. 


Connected Components 

Warp 

Hand-Coded 

Warp Adapt 
Column-Partitioned 

Warp Adapt 
Row-Partitioned 

Sun 4/330 

Adapt 

Size of regions in image: 



Small 

Large 

Small 

Large 

Small 

Large 

Nearest-neighbor propagation 

Total: 


3.56 

102 

1.92 

59.8 

2.89 

73.6 

Initialize 



0.416 

0.333 

0.232 

0.257 

0.316 

0.333 

Propagate 



3.14 

102 

1.69 

59.5 

2.57 

73.3 

Shrink-expand 

Total: 


6.10 

311 

3.13 

162 

8.56 

434 

Shrink step 



2.92 

149 

1.23 

64.4 

3.28 

169 

Expand step 



3.18 

162 

1.90 

97.3 

5.28 

265 

Union-find 

Total: 

4.51 

62.5 

74.5 

1.30 

1.31 

2.53 

1.91 

Step 1 



6.50 

6.50 

0.347 

0.356 

0.853 

0.769 

Step 2 



55.5 

67.5 

0.526 

0.523 

1.22 

0.783 

Step 3 



0.528 

0.549 

0.428 

0.429 

0.456 

0.354 


Adapt, the Warp hand-coded times were 
compared with the Warp row-parti¬ 
tioned (Adapt) times. Several factors 
influence this comparison: 

• In the Adapt programs, the image 
size is specified at run-time; this is a 
feature of the Adapt compiler. The hand- 
coded Warp programs require specifi¬ 
cation of the image size at compile time. 
Run-time image size specification in¬ 
troduces a small amount of overhead 
and eliminates some opportunities for 
optimization by the W2 compiler. (W2 
is the base language for Warp.) This 
accounts for the better hand-coded per¬ 
formance in most of the pattern-gener¬ 
ation programs. 

• The hand-coded programs were de¬ 
signed under assumptions that became 
invalid with the evolution of the W2 


compiler. As the compiler matured, cer¬ 
tain special-purpose features were elim¬ 
inated, which made the hand-coded pro¬ 
grams less efficient. The Adapt 
compiler’s program mapping is more 
recent and corresponds more exactly to 
the current W2 compiler. This validates 
the Adapt approach since it is much 
easier to keep the Adapt programs up 
to date (that is, by changing only one 
program — the Adapt compiler) than 
the hand-coded programs. “Histogram” 
and “Minimum bounding rectangle” in 
Table 2 show better performance be¬ 
cause of this. 

•In some cases, the mapping tech¬ 
nique used in the row-partitioned Adapt 
is better than the hand-coded programs, 
which generally partition the image in 
columns. Table 3 shows this for the 
“Union-find” connected-components al¬ 


gorithm, which is faster in Adapt than 
in hand-code. 

• In some cases, for example, “Dia¬ 
monds” in Table 2, the Adapt algo¬ 
rithm is faster. 

Overall, the results show that Adapt 
provides increased functionality at com¬ 
parable or better performance than 
hand-code. A comparison of the Warp 
row-partitioned times with the Sun 4/ 
330 times shows that the Sun 4/330 
performs nearly as well as Warp except 
on algorithms that include a lot of float¬ 
ing-point computation. This is not sur¬ 
prising considering that the Sun’s ar¬ 
chitecture is more recent and the Warp’s 
integer capability is relatively poor. 

The results for connected compo¬ 
nents show the general superiority of 
the nearest-neighbor algorithm to the 
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others on images with very small regions (only a few pixels 
each). The union-find algorithm is superior on other images 
because of its near-constant execution time independent of 
the image regions. 

I t is possible to program both local and global 
image processing operations in a single framework by 
basing the framework on a powerful programming model, 
namely, the split-and-merge model. We can apply this model 
to a wide variety of algorithms — specifically, to any algo¬ 
rithm that can be computed in forward or reverse order over 
a data structure, here an image. Adapt implements this 
model efficiently on several different parallel computers. 
Adapt programming techniques are based on the concept of 
an invariant, which applies to the image. As a result, Adapt 
programming is similar to serial programming. Adapt can be 
implemented in a variety of ways, depending on the charac¬ 
teristics of the target architecture. Results from Adapt im¬ 
plementations show that Adapt provides an efficient base for 
architecture-independent programming. ■ 
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Massively Parallel 
Model Matching 

Geometric Hashing 

on the Connection Machine 


Isidore Rigoutsos and Robert Hummel 
New York University 


Geometric hashing 
provides a parallel 
method for model 
matching. However, the 
design and 
implementation of 
parallel algorithms for 
AI applications is 
fraught with subtlety 
and challenge. 


G eneral-purpose parallel computer architectures are now designed with 
computationally intensive applications such as computer vision in mind. 
Nevertheless, using an existing parallel machine to develop a working 
vision system presents numerous problems. Our implementation of an object- 
recognition algorithm on a Thinking Machines Corporation CM-2 Connection 
Machine 1 may typify the choices and difficulties involved. 

Developing realistic vision systems that can recognize rigid objects from a 
database of hundreds of models is a continuing goal of vision researchers. A model- 
based vision system extracts features such as edges and points from digital imagery 
and compares them with a database of models to identify objects within a scene. 
Many model-based vision systems are based on hypothesizing matches between 
scene features and model features, predicting new matches, and verifying or 
changing the hypotheses through a search process. 23 A new method, called 
geometric hashing , 4 offers a different and more parallelizable paradigm (see the 
sidebar on the following page). 

Geometric hashing uses the collection of models in a preprocessing phase 
(executed off line) to build a hash table data structure. The data structure encodes 
the model information in a highly redundant, multiple-viewpoint way. During the 
recognition phase, when presented with a scene and extracted features, the hash 
table data structure indexes geometric properties of the scene features to candi¬ 
date models. The geometric hashing scheme still requires a search over features in 
the scene, but it obviates a search over the models and the model features. Thus, 
its recognition phase offers computational efficiencies, and geometric hashing is 
highly amenable to parallel implementation. 

In this article, we explore the parallelizability of geometric hashing and present 
two algorithms. The first one uses (1) parallel hypercube techniques to route 
information through a series of maps and (2) building-block parallel algorithms. 
The second algorithm uses the Connection Machine’s large memory resources and 
achieves parallelism through broadcast facilities from the front end. 
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We will confine ourselves to the prob¬ 
lem of recognizing dot patterns embed¬ 
ded in a scene after they have under¬ 
gone translation, rotation, and scale 
changes. Each dot can represent a fea¬ 
ture location extracted from an image. 
In a more general vision system, we 
would like to recognize patterns of lines, 
corners, and other features, attached to 
three-dimensional objects, undergoing 
rigid 3D transformations and perspec- 


Geometric hashing for 

Suppose we want to recognize pat¬ 
terns of points that may be translated, 
but for the moment, we assume no ro¬ 
tation, scaling, or other transforma¬ 
tions. The model in Figure A consists 
of five dots. Suppose that we place dot 
1 at the origin of a coordinate system. 
Then the other dots lie at four different 
(x, y) locations. Let’s record in a quan¬ 
tized hash table, in each of the four 
bins where this information lands, the 
fact that model M, with basis point 1 
yields an entry. Figure A shows this 
graphically, viewing only entries of the 


tively projected onto an image plane. 
Conceptually, the geometric hashing 
algorithms will extend to such cases. 
But many implementation issues must 
be addressed, and the transformation 
classes of the features will be more com¬ 
plicated. In particular, the coordinates of 
the points in our study are represented 
by two-dimensional entities. More com¬ 
plex features will involve more dimen¬ 
sions and more complex transformations. 


form (/ty, 1). Similarly, the hash table con¬ 
tains four entries of the form (M,, 2), four 
entries of the form (M„ 3), and so on. 

Each entry is generated by placing the 
base point at the origin of the hash table 
and observing where the other points of 
the model land. The same process is re¬ 
peated for each model. Of course, hash 
bins may receive more than one entry. As 
a result, the final hash table contains a list 
of entries of the form (model, base point) 
in each bin. 

In the recognition phase, a single point 
from the scene is chosen as a candidate 


In our study, we use patterns of 16 points; 
models involving more complex features 
will need to be described by a similarly 
small number of primitives. 

As implemented, the algorithms rec¬ 
ognize models consisting of patterns of 
points embedded in scenes, indepen¬ 
dent of translation, rotation, and scale 
changes. Thousands of models may be 
used, each containing approximately 16 
points, with scenes consisting of hun- 


basis point. The coordinates of all oth¬ 
er points are then calculated with this 
point placed at the origin. Each re¬ 
maining point is mapped to the hash 
table, and all entries in the corre¬ 
sponding bin receive a vote. If there 
are sufficient votes for one or more 
(model, basepoint) combinations, then 
a subsequent stage attempts to verify 
the presence of a model with the des¬ 
ignated point located at the chosen 
basis point, if points are missing from 
the scene because they are obscured, 
recognition is still possible, as long as 


point matching 



Figure A. Model M, consisting of five points and all resulting hash table entries. 
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dreds of points. Most of the scene points 
are spurious noise points, and points 
may be obscured or misplaced. The sys¬ 
tem searches over pairs of scene points 
and obtains recognition as soon as both 
points lie on the embedded object. How¬ 
ever, multiple pairs can be probed at the 
same time, and many heuristics exist for 
choosing likely basis pairs. With 1,024 
models and scenes consisting of 200 
points, execution time on a 64K-proces- 


there is a sufficient number of points 
hashing to the correct bins. The list of en¬ 
tries in each bin may be large, but be¬ 
cause there are many possible models 
and basis sets, the likelihood that a single 
model and single basis set will receive 
multiple votes is quite small, unless a 
configuration of transformed points coin¬ 
cides with a model. In general, we do not 
expect the voting scheme to give only one 
candidate solution (see Lamdan and Wolf- 
son 5 ). The goal of the voting scheme is to 
reduce significantly the number of candi¬ 
dates for the verification step. 

The case where the dot patterns can 
also undergo rotation and translation is 
treated analogously. Two points are now 
needed to define a basis, and point loca¬ 
tions are measured relative to a coordi¬ 
nate system defined by the pair of points. 
When we place the basis midpoint at the 
center of a coordi¬ 
nate system (see 
Figure B), the re¬ 
maining points of 
model M, land in 
three locations. A 
quantized hash ta¬ 
ble will now record, 
in each of the three 
bins where the dots 
land, the fact that 
(in this case) model 
/W, with basis (4,5) 
yields an entry in 
this bin. During rec¬ 
ognition, two points 
from the scene are 
chosen as the ba¬ 
sis. The coordi¬ 
nates of the other 
scene points are 
calculated relative 
to the coordinate 
system defined by 
the basis pair. 


sor CM-2 would be roughly 70 millisec¬ 
onds per probe with a single basis set. 

Geometric hashing 
connectionist algorithm 

The first parallel algorithm is data 
parallel over the hash-table bin entries 
and scene points, but serial over the 


Each of the remaining points is again 
mapped to the hash table; all entries in 
the corresponding bin receive a vote; 
and we continue as before. If the dot 
pattern is also allowed to undergo 
scale changes, we use the length of 
the selected basis as the length of the 
unit vector of the coordinate system. 

For the algorithm to be successful, it 
is sufficient to select as a basis tuple 
any pair of points belonging to some 
model. It is not necessary to hypothe¬ 
size which model nor which model 
points are the corresponding points, 
since all models and basis pairs are 
redundantly stored within the hash ta¬ 
ble. Classification or perceptual group¬ 
ing of features can be used to make 
the search over scene features more 
efficient, for example, by making use 
of only special basis pairs. 


bases. We assume that our database 
contains M models; each model m has 
an associated set, S m = [(x mk , y m *)}C =1 , 
containing the coordinate pairs of the 
mth model’s n points. 

Preprocessing. For the preprocessing 
phase, where the hash table is created 
from the model set, the algorithm has 
time complexity 0 (M log n) if Mn 3 
processors are used on a concurrent- 
read exclusive-write SIMD Hypercube. 
We iterate sequentially over the M 
models. Initially, each of the low-order 
n processors contains the (x,y) coordi¬ 
nates of one of the n points in S m for the 
mth model. The set ( S,„ x S m ) x S m is then 
computed using the triple-product al¬ 
gorithm (see the sidebar, “Building- 
block algorithms,”on the next page). 
Each of the first n 3 processors of set V 2 
now contains a triplet: [(x„ y,), (x ,, yj), 
(. x k , y*)] e (S m x S m ) x S m . The first two 
points of each triplet define an ordered 
basis and thus a coordinate system. Ac¬ 
cordingly, each processor with a rea¬ 
sonable basis pair and a distinct third 
point can compute the coordinates of 
the third point of the triplet relative to 
that coordinate system. (Note that some 
processors will be turned off at this 
point.) These coordinates can then be 
converted to a hash bin number. This 
phase is completed once each such pro¬ 
cessor communicates its information to 
the appropriate hash bin. 

To efficiently communicate model and 
basis-point information to the hash bins, 
the algorithm performs two passes 
through the models. In the first pass, it 
counts the number of entries that will 
occur in each hash bin. Each processor 
with an entry destined for a hash bin 
sends an additive write with increment 
1 to an accumulator in that bin. By 
performing a parallel prefix sum of the 
resulting counts, the processor pool can 
be organized into a one-dimensional 
array so that each hash bin occupies a 
contiguous block of (virtual) proces¬ 
sors and the block length equals the 
number of expected entries. Further, a 
map gives the index of the head proces¬ 
sor for each block of processors repre¬ 
senting a hash bin. 

In the second pass, we again iterate 
sequentially over the models. This time, 
for each processor that computes a hash 
table entry, the information is sent to 
the appropriate processor in the hash 
table group. The location of the appro¬ 
priate processor can be maintained by 



Figure B. Coordinate system for normalizing models us¬ 
ing two-point bases. 
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Building-block algorithms 


Certain building-block algorithms are 
fundamental to the programming of a 
hypercube-based SIMD architecture. 
We will need the following components. 

Triple product. Given three finite 
sets A = {a,}-!,, B = {b t }%, and C = 
{c k }>,„ the triple product Ax Bx C 
is the set of all the ordered triplets 
{(a, 

One way to compute the triple prod¬ 
uct is to perform an outer product 
twice. An outer product for the Con¬ 
nection Machine is succinctly de¬ 
scribed in Little, Blelloch, and Cass. 6 
An extension of the method leads to a 
direct triple product computation, 
which we now describe. 

Using standard gray-code embed¬ 
ding algorithms, we configure the 
hypercube as a three-dimensional ar¬ 
ray of size L, x 4 x 4. (We assume, 
purely for convenience, that the L, are 
powers of 2.) The processors are in¬ 
dexed by their coordinates (/, j, k), and 
initially data element a, is contained 
in processor (/, 0, 0), bj in processor 
(0, j, 0), and c k in processor (0, 0, k). 

The algorithm has two phases. Dur¬ 
ing the first phase, the a, data is 
spread along a row in the direction of 
the y axis, the b t data is spread in the 
direction of the z axis, and the c k data 
is spread in the direction of the x axis 
(see Figure C). In the second phase, 
the data on each plane is spread into 
the entire cube, first spreading the 
data on the (x, y) plane along the z 
axis, then the data on the (y, z) plane 
along the xaxis, and finally the data 
on the (x, z) plane along the y axis. 
Upon completion, processor (/', j, k) will 
have received datum a, from (/, j, 0), 


datum bj from processor (0, j, k), and 
datum c k from processor (/', 0, k) and thus 
has the triple product element (a,, 4 c k ). 

The operation of spreading data along a 
single axis that occurs during both phases 
can clearly be performed in O (/_,) time, 
since nearest neighbors are adjacent in 
the hypercube, but can in fact be complet¬ 
ed in O(log L,) time. This is because we 
may use a recursive doubling scheme to 
spread the data rapidly along the axis. 
(Algorithms of this kind are described by 
Hillis and Steele. 7 ) In the parlance of the 
Connection Machine’s Paris language, the 
operation is a scan_with_copy. Power-of- 
two communication along each axis is 
provided by 0(1) communication cycles 
due to the gray-code embedding. Specifi¬ 
cally, If g(i), i= 0,1. n - 1, is a gray 

code (n a power of 2), then it can be 
shown that g(i) and g((i + 2 k ) mod n) dif¬ 
fer in at most two bits, and thus can be 
connected by two communication cycles 
on a hypercube.This is true for any value 
of k. 

Histogramming. Given a collection of 
data (a,}4, such that each a, is an ele¬ 
ment of a finite collection of possible val¬ 
ues, say a, e {1, 2. V], the histogram 

is a count of the number of elements 
equal to each possible output value, that 
is, H(k) = # {/1 a, = k). 

Little, Blelloch, and Cass 6 describe 
three approaches to histogramming: se¬ 
quential iteration through the value set, 
additive writes, and sorting. Let us consid¬ 
er the sorting method in more detail. 

We first sort the data so that a„ w forms 
a nondecreasing sequence. For example, 
the Batcher bitonic sort algorithm 8 oper¬ 
ates on a hypercube machine in O(log 2 /V) 
time. After sorting, each processor can 


determine if the data in the processor to 
its left is different. If so, it marks it¬ 
self as the head of a constant-data 
block. Since each processor needs to 
be able to communicate with its neigh¬ 
boring processor for this step, the pro¬ 
cessors should be configured as a 
one-dimensional array embedded in 
the hypercube, using a gray-code em¬ 
bedding (the Batcher sort process is 
still efficient in this configuration). 

Next, each head processor counts the 
number of processors in its constant- 
data block by means of a segmented 
parallel prefix sum. Finally, each head 
processor sends the information about 
the cardinality of its block to the appro¬ 
priate histogram bin, a„ w . Since the 
destinations of the messages are dis¬ 
tinct and ordered relative to the source 
indices, these messages can be sent 
using an 0(log N+ log V) contention- 
free algorithm of the sort described by 
Nassimi and Sahni. 9 The total com¬ 
plexity of histogramming by sorting is 
thus 0(log 2 N + log V). 

For our purposes, the histogram 
vector is not needed; rather, we only 
need knowledge of the few maximum- 
vote-getting values. To this end, the 
final stage of sending messages can 
be omitted, and the maximum counts 
among the marked processors can be 
determined and relayed to the front 
end. Thus, the process of finding the 
few maximum histogram bins can be 
accomplished in 0(log 2 N) time. 

Further, we can do better than the 
Batcher sorting algorithm. Lin and Ku¬ 
mar 10 provide a hypercube-based radix 
sort algorithm; in the sidebar entitled 
“Simple radix sort on a hypercube,” we 
outline a simpler method that also has 
time complexity O(log Vx log N). 
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Figure 1. Recognition phase of the parallel geometric hashing connectionist algorithm. Note how tokens flow from one 
set via connections to the next set. 


incrementing the map entry pointing to 
the head of the corresponding hash bin’s 
block. Collisions can occur if more than 
one processor wishes to append to a 
single hash bin’s list. By using a SIMD 
version of a parallel fetch-and-add in¬ 
struction, each processor requesting a 
destination virtual processor in a hash 
table bin is assured a distinct address 
while indivisibly (and concurrently) in¬ 
crementing the counter pointing to the 
next empty location of the block. 

The hash table is now contained in 
two data structures. The first structure, 
one processor for each hash bin h, con¬ 
tains pointers to a head processor T h of 
the block of entries in the second data 
structure. The second data structure 
consists of at most Mn(n-l)(n-2 ) hash 
bin entries of the form (m, i, j). 


Recognition. In Figure 1, a schematic 
diagram of the recognition phase, we 
use the virtual processor set (VP set) 
concept found in Connection Machine 
literature. A VP set is simply an ab¬ 
stract finite set that will be mapped to 
virtual processors. The VP sets in the 
recognition phase are the feature coor¬ 
dinate set V x and the hash table sets V 2 
and V 3 . We assume that a set of S inter¬ 
est points have been extracted from the 
scene and that each coordinate pair re¬ 
sides in the local memory of one of the 
S processors of the VP set V t . For the 
hash table, VP set V 2 contains the point¬ 
ers to the heads of the blocks of entries, 
and V 3 is the one-dimensional array of 
concatenated lists of hash entries. 

In the first stage, the front end selects 
a basis pair in the scene and broadcasts 


the pair’s coordinates to the S proces¬ 
sors of V x . Each processor in V x com¬ 
bines the coordinates of its interest point 
with the broadcast pair to compute the 
index of a hash bin. In the second stage, 
messages saying “you receive one vote” 
are sent by the processors of V, to the 
appropriate processors in V 2 . The mes¬ 
sages are sent using additive writes and 
general routing; multiple votes destined 
for the same recipient processor com¬ 
bine in the routers. In the last stage, 
every V 2 processor h that received one 
or more messages relays the number of 
votes it received to the block of proces¬ 
sors T h through T M - 1 of V y This 
operation can be done, for example, 
using a modified version of Nassimi and 
Sahni’s Generalize algorithm. 9 Alter¬ 
natively, every V 2 processor h can send. 
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Simple radix sort on a hypercube 


Assume that the values in the sequence to be sorted, {a,}^,, 
are represented in binary bit form, and let [b k i }')L 1 be the se¬ 
quence of the kth-from-the-right bits. We sort the values in a 
stable fashion. 

For k beginning at zero, and successively increasing to 
log V- 1, we do the following: 

• Mark all processors with b kJ = 0. 

• Rank these processors: Each marked processor deter¬ 
mines its relative position among all marked processors us¬ 
ing a parallel prefix sum (Nassimi and Sahni 9 describe a 
Rank algorithm). Let r t be the rank of a processor if it is 
marked and t be the maximum r,. 

• Mark all processors with b kJ = 1. 

• Rank these processors as well; let s, be the rank of the 
/th such processor. 

• Move the a ; data: Every processor with b kj = 1 sends its 
data a, to processor f+ s„ while every processor with b ki = 0 
sends its data to processor r,. Because the paths of com¬ 
munication are ordered, this routing can be completed in 

O (log N) time, using the Concentrate and Distribute algo¬ 
rithms from Nassimi and Sahni. 9 



After the first iteration, all items are stably sorted with re¬ 
spect to their low-order bit. Upon termination, the sequence 
{a,}^., will be sorted. 


a message containing the number of 
votes (which might be zero) to proces¬ 
sor T h in V 3 . Using a parallel prefix 
computation with “copy from the left” 
as the binary associative operator, pro¬ 
cessor T h can then spread the count to 
the remaining members of its group. 

At this point, we want to histogram 
the entries of the processors in set V 3 
using the multiplicities determined in 
the previous step. Efficient histogram- 
ming methods should be utilized; in 
particular, the radix sort algorithm (see 
sidebar above) offers advantages. In our 
current implementation, purely for ease 
of coding, we use additive writes in¬ 
stead. Consequently, a fourth VP set, 
the histogram bin set V 4 , is required. 
Each processor of V 4 is associated with 
one histogram bin representing a triplet 
( m, i, j). The processors of V 3 vote for 
their ( m, i,j) entries by sending an addi¬ 
tive write message to the appropriate 
histogram bin. The increment in these 
messages is the value of the votes they 
received in the algorithm’s third stage. 
The additive writes combine in the rout¬ 
ers, resulting in histogram counts in the 
Mn(n -1) histogram bins. Lastly, a glo¬ 
bal-max operation (or a thresholding) 
of the vote tallies over the processors of 
the set V 4 recovers the winning ( m, i, j) 
combinations. These combinations are 
communicated to the front end to verify 
the existence of matching models. 

For the time complexity of the recog¬ 


nition phase, we assume that Mn 3 pro¬ 
cessors are available and that the num¬ 
ber of hash table bins is less than or 
equal to Mn 3 . The time complexity of 
the recognition phase, per broadcast 
basis pair, is dominated by the histo- 
gramming step. In fact, the time com¬ 
plexity of the remaining operations of 
the recognition phase is no worse than 
0(log(Mn 3 )), which is the same as 
0(log(Mn)). The complexity of histo- 
gramming depends on the particular 
method. Using Batcher’s bitonic sort 9 
for histogramming, the time complexity 
of the recognition phase is 0(log 2 (5Mn)), 
where S is the number of feature points. 
Using the radix sort algorithm lowers it 
to 0(log(SMn) log (Mn)). 

Hash-location 
broadcast algorithm 

The above algorithm approaches geo¬ 
metric hashing as a connectionist pro¬ 
cess, with information flowing via pat¬ 
terns of communication. Our second 
algorithm uses the Connection Machine 
as an intelligent memory source and is 
inspired by the inverse indexing meth¬ 
od of data retrieval. 

Hash-table data structure. The hash 
table is organized differently in this al¬ 
gorithm. The data can be regarded as a 


collection of records of the form ( m , i,j, 
k, x, y), where (x, y) is the hash location 
that point k maps to under basis (/',;') in 
model m. The information can be stored 
in a multidimensional table indexed by 
(m, i, j, k), where i, j, and k are integers 
between 1 and n. Not all of the Mn 3 
array locations will be used. Locations 
where point k occurs in the basis (i, j ) 
will be empty, and all bins correspond¬ 
ing to poor basis combinations (a basis 
with two very close points) will also be 
empty. The (m, i, j, k) information in 
each record can be recovered from the 
self-index of the entry location in the 
array, although we found it convenient 
to store the data explicitly along with 
the (x, y) values. 

Preprocessing. The construction of 
the hash table is a simplification of the 
previous algorithm’s preprocessing 
phase. After loading the model point 
data into the appropriate processors, M 
simultaneous triple products of S m x S m 
x S m are computed for m = 1,2,..., M in 
0(\og n) time. This forms the four¬ 
dimensional array. Then every location 
that can compute a reasonable hash lo¬ 
cation computes the corresponding 
( x , y) value and stores the information 
locally. 

Recognition. Figure 2 shows a sche¬ 
matic diagram of the recognition phase. 
The VP set V x contains the feature coor- 
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dinate set, obtained from the 5 interest 
points (5 is typically 200) that have been 
extracted from the scene, and V 2 con¬ 
tains the four-dimensional hash-table 
data structure preloaded into the mem¬ 
ory. 

We operate sequentially through the 
basis sets and the scene points. First, the 
front end selects a basis pair in the scene 
and broadcasts the coordinates of the 
points in the pair to the 5 processors of 
V,. The two processors whose interest 
points form the selected basis do not 
participate in the coordinate computa¬ 
tion; the remaining 5-2 processors 
then compute the hash location of the 
locally stored scene point in the coordi¬ 
nate frame of the broadcast basis. These 
operations involve minimal data move¬ 
ment and thus are extremely fast. 

In the second stage, the data from the 
5-2 processors in V, are successively 
broadcast to all processors of set V 2 . 
Each broadcast coordinate has the form 
(u,v) and gives a hash table location 
where a vote should be tallied. Each 
processor in V 2 , indexed by ( m, i,j, k), 
contains a hash location ( x , y), which 
the processor can compare against (m,v). 


If the two locations are sufficiently close, 
table location (m, i, j, k) records a hit 
indicating a vote for model m and basis 
( i , j). (An extremely useful modifica¬ 
tion to geometric hashing permits 
weighted voting for model-basis pairs 
according to the relative proximity of 
(m,v) to (x, y).) The vote tallying contin¬ 
ues by accumulating hits in each hash 
table location for each of the 5-2 
points in the image. 

When the tallying is complete, a third 
stage uses a segmented, parallel, tree- 
based sum operation to add the votes 
over k among locations (m, i,j, k). The 
result is the total number of votes that 
model m with basis (i, j) obtains for the 
given scene and basis selection. Finally, 
a global-max or thresholding operation 
among the processors with locations 
holding the sum of votes determines the 
winning model/basis combinations. A 
final verification step determines the 
quality of each match. 

Strictly speaking, each of the 5-2 
broadcasts will require 0(log(Mn 3 )) 
time, since there are Mn 3 processors in 
the V 2 data set. However, the theoreti¬ 
cal complexity can be decreased, at the 


expense of requiring 5 storage locations 
in each processor, assuming that 5 < 
Mn 3 . Assuming for simplicity that 5 = 
n 2 , all 5 - 2 broadcasts can be done 
simultaneously by having each proces¬ 
sor in Vi send its data to a unique pro¬ 
cessor in a two-dimensional (n x ri) slice 
of the four-dimensional data set V 2 . This 
routing can be completed in time 
0(log(n 2 )). This slice of data can then 
be spread to the rest of V 2 , in parallel 
slices, requiring no more than 0(log 
Mn) time. 

Observe that at this point the entire 
set of the computed coordinate pairs is 
distributed among the n 2 processors of a 
slice, one coordinate pair per processor. 
The processors within a slice can now 
exchange their data so that the entire 
list of computed coordinate pairs be¬ 
comes available to each of them. This 
can be achieved simply by a recursive 
doubling procedure that communicates 
data between pairs of processors and 
forms lists of coordinate pairs. Note 
that the entries of those lists will not 
appear in the same order in each pro¬ 
cessor. This recursive doubling proce¬ 
dure can be completed in 0(5) time. 
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Figure 3. Hash table equalization for similarity transformations. The model points have a Gaussian distribution 
Shown at left, the hash table before equalization; at right, the hash table after equalization. 


The time complexity of the entire 
recognition phase is dominated by the 
second stage. Indeed, the time com¬ 
plexity of the first stage is 0(log S). The 
second stage, using the data-spreading 
trick just described, results in time com¬ 
plexity no worse than 0(S + log(Mn)), 
which is also the complexity of the rec¬ 
ognition phase. 

Implementation on a 
Connection Machine 

Implementing carefully crafted par¬ 
allel algorithms on existing architectures 
frequently involves more compromises 
than one might expect. In our case, our 
algorithms have assumed the existence 
of Mn 3 processors, which for M = 1,024 
and n = 16 would entail a 4-million- 
processor machine. Although there are 
64K-processor Connection Machines, it 
is more usual to have access to a 32K- or 
16K-processor model and to do proto¬ 
typing on an 8K-processor model. We 
can use Connection Machine software 
facilities to simulate a larger parallel 
machine by mapping multiple virtual 
processors to each physical processor, 
which then execute in round-robin fash¬ 
ion. However, the virtual processor ra¬ 
tio (VPR) would then equal 512 on an 
8K-processor model, which incurs an 
impractical amount of overhead. Ac¬ 
cordingly, we must modify the algo¬ 
rithms somewhat and employ other ef¬ 
ficiencies. 

Hashing connectionist algorithm. 

Rather than give each hash table entry 
a separate processor, we can store the 


entire list of entries for a hash bin in a 
single processor’s local memory. The 
lengths of the lists will vary over the 
hash table, but the required number of 
processors drops to the number of de¬ 
sired hash bins. The preprocessing phase 
of creating the hash table becomes far 
less efficient, due to the processor’s need 
to randomly access local memory as 
entries are appended to the lists. Fur¬ 
ther, collision contention becomes more 
delicate. But, provided no single list 
becomes exorbitantly long, memory re¬ 
quirements are not a problem. 

For the recognition phase, the entries 
in the hash bins that receive votes must 
be histogramed (that is, counted) with 
the multiplicity of the number of votes 
that each hash bin receives. For 1,024 
models having 16 points each, the en¬ 
tries consist of 18-bit codes (10 bits for 
the model number, and 4 bits for each of 
the two basis points). Rather than histo- 
gramming by sorting, we opt for a mes¬ 
sage-passing strategy. We set up 2 18 = 
256K buckets (which requires a virtual 
processor ratio of 32 on the 8K-proces- 
sor machine), one bucket for each 
(m, i,j ) combination. Each hash bin that 
receives one or more votes from the 
scene points then synchronously walks 
down its list of entries, sending messag¬ 
es to the corresponding (m, i,j) buckets. 
On the 8K-processor machine, each hash 
bin has, on the average, 420 entries in its 
list. The time needed for list traversal is 
clearly dominated by the longest list. 
This process currently accounts for 99 
percent of recognition-phase execution 
time, and it uses the parallel architec¬ 
ture less efficiently than the radix-sort 
approach to histogramming. 

To make the process as efficient as 


possible, we suggest two enhancements. 
First, it is clearly desirable to keep the 
lists of entries as even as possible over 
the hash bins. By employing a rehashing 
function, we can effectively requantize 
the hash table such that the expected 
density of list lengths becomes uniform. 11 
Figure 3 plots the hash bin occupancies 
before and after requantization for a 
typical database of models. 

Second, we can use certain symme¬ 
tries in the hash table; in particular, if an 
entry of the form (m, i, /') hashes to a 
location (x, y), then there will be an 
entry ( m,j , i) in location (-*, -y). Thus, 
when calculating a point’s hash loca¬ 
tion, we can remap points from the low¬ 
er half plane to the upper half plane, 
and either confuse entries of the form 
(m, i,j) and ( m,j , i) (which could cause 
some degradation in discriminability) 
or mark such remapped hashings as 
“basis-inverted.” Accordingly, only half 
the hash table is required, and the entry 
lists become, on the average, half as 
long when spread over the existing pro¬ 
cessor set. 

Hash-location broadcast algorithm. 

To reduce the virtual processor ratio in 
the second algorithm’s implementation, 
we assign one processor to each index 
(m, i,j) and store the n entries associat¬ 
ed with fc = 1,2,...,«in its local memory. 
The computed hash location for each of 
the S scene points is broadcast to all 
processors, and each processor com¬ 
pares the location with the n locations 
stored in its local memory. Thus, Sn com¬ 
parisons are needed (per basis probe). 

Efficiencies can be achieved by the 
symmetries mentioned above. Each of 
the entries ( m, i,j, k, x, y) is mirrored by 
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an entry of the form ( m,j , i, k, -x, -y), so 
by comparing a broadcast location (h,v), 
where v < 0, with (-x, -y) instead of (x, 
y), we can omit half of the hash table 
entries. Further, through additional pre¬ 
processing of the local hash table en¬ 
tries, and by broadcasting the 5 points 
in an appropriate order, processors can 
use a two-dimensional version of list 
merging and avoid looking through the 
entire lists. This enhancement, which 
requires nonuniform random access to 
local processor memory (supported in a 
limited fashion on the Connection Ma¬ 
chine), reduces the theoretical complex¬ 
ity to 0(5 + n) but does incur other 
overhead. 

Languages. Although a number of 
special-purpose parallel languages have 
been developed for the Connection 
Machine, we found C code running on 
the front end, enhanced with system 
calls to the Connection Machine using 
its Paris package, the most suitable for 
our needs. The Paris package includes 
many of the building-block and routing 
algorithms that we have mentioned; thus, 
it gives us the greatest level of control 
over the machine. For the broadcast- 
based algorithm, just about any lan¬ 
guage would suffice, and the Paris prim¬ 
itives represent a fast development path. 


Performance results 

We generate models (dot patterns) of 
16 points each, using either a uniform 
distribution over a region or a Gaussian 
distribution. After generating 1,024 
models, we construct scenes of approx¬ 
imately 200 points, such as those shown 
in Figure 4. A single model is embedded 
in the scene, translated, rotated, and 
scaled. Noise is added to the scene points 
through quantization round-off error. 

In both implementations, the front 
end randomly selects a pair of scene 
points (a probe) as the basis for possible 
recognition. A connectionist algorithm 
probe takes 1.52 seconds on an 8K-pro- 
cessor machine, which drops to 0.24 sec¬ 
onds on a 32K machine, using rehashing 
but not the symmetries described in the 
preceding section. If the symmetries 
were used, the probe time would drop 
accordingly. The plots in Figure 5 show 
the connectionist algorithm operating 
in a roughly linear regime — that is, we 
are achieving linear speedup due to the 
heavy loading. In fact, as the number of 



Figure 4. Recognition examples for the connectionist algorithm (top) and the 
broadcast algorithm. In the center panels, the embedded model is shown in sol¬ 
id dots, and the randomly selected basis points that resulted in recognition are 
shown boxed. Full recognition required 209 probes in the top example, 150 in the 
lower one. The database contained 1,024 models. 



Figure 5. Average time required for a single basis probe, using 1,024 models of 
16 points each, as a function of the number of processors in the Connection Ma¬ 
chine; the scenes contain 200 points. 
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processors increases, reduced conten¬ 
tion in the routing algorithm gives us, in 
some cases, an apparent extra boost. 
But such improvements would not con¬ 
tinue forever. Figure 4 shows some rec¬ 
ognition examples. 

In the broadcast algorithm, the 8K- 
processor machine processes a probe at 
a rate of 10 milliseconds per scene point, 
that is, approximately 2.0 seconds for a 
probe using a 200-point scene. Neither 
symmetries nor list merging efficiencies 
were employed. Experiments with a 16K- 
and 32K-processor model indicate nearly 
linear increases in speed (see Figure 5), 
so a 64K-processor machine should be 
able to perform a probe in about 300 
milliseconds without the use of symme¬ 
tries. 

By way of comparison, both algo¬ 
rithms are easily coded on a typical 
high-performance workstation. Perfor¬ 
mance results are highly dependent on 
disk access rates and available memory, 
but we have seen probe times of roughly 
35 seconds for the equivalent of the 
hash-location broadcast algorithm on a 
Sun Sparcstation-2. (Remember that 
each of the processors in a Connection 
Machine is merely a slow bit-serial 
ALU.) 


R ecognizing all models embed¬ 
ded in a scene, if there are 
many, would require many 
probes. Achieving processing times of 
seconds instead of minutes would re¬ 
quire (1) access to a large parallel ma¬ 
chine, (2) smart methods for choosing 
basis pairs in the scene, and (3) further 
performance enhancements. In the third 
area, we could make better use of sym¬ 
metries and employ foldings, where mul¬ 
tiple model/basis combinations are coa¬ 
lesced into a single bucket, to reduce 
the time for histogramming. 

Both algorithms exhibit sublinear 
growth in execution time as the number 
of models M increases (using O(M) pro¬ 
cessors), which is a hallmark of geomet¬ 
ric hashing methods. However, the con- 
nectionist algorithm has better 
asymptotics as the number of points S in 
a scene increases, and it performed slight¬ 
ly faster in our implementations with 
200-point scenes. The hash-location 
broadcast algorithm is simpler, and more 
amenable to improvements, and should 
ultimately prove superior when the num¬ 
ber of scene points is not too large. ■ 
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Dynamic Control and 
Prototyping of Parallel 
Algorithms for 
Intermediate- and 
High-Level Vision 


A.M. Wallace, G.J. Michaelson, P. McAndrew, K.G. Waugh, and W.J. Austin 
Heriot-Watt University 


The authors present 
recent work that 
exploits MIMD 
architectures for 
problems in 
intermediate- and 
high-level vision in 
computers. They 
emphasize the 
inherent potential for 
dynamic control of 
algorithms on the basis 
of partial results. 


D eveloping parallel algorithms for intermediate and high levels of comput¬ 
er vision systems is much more difficult than developing algorithms for 
low-level iconic image processing. Not only are the algorithms more 
complex, but the nature and size of the input and output data sets vary for each 
application. Consequently, we have directly developed parallel algorithms for 
dynamic control of both processing and communication complexity during execu¬ 
tion. We have also examined the merits of functional prototyping and transforming 
programs into imperative execution code for final implementation. To evaluate 
and give direction to our work, we have implemented algorithms for plane 
detection and object recognition on a flexible transputer network. 


Parallel processing and computer vision 

Low-level processing of image data consists principally of static pixel-to-pixel 
transformations like median filtering and edge detection by gradient operators. 
Since these algorithms are fully predetermined and act identically on all input data 
— and the initial and final data structures are fixed arrays — the algorithms are 
ideally suited to single instruction, multiple data (SIMD) machines. Data parallel¬ 
ism is achieved readily with near-linear increases in processing speed. 

Iconic data processing is commonly followed by a segmentation process that 
extracts features corresponding to those used to form object models. For example, 
two-dimensional edge data may be combined into extended features such as lines 
or arcs. Although the input data structure is static, the output data may vary in size 
(the number of anticipated features is unknown). 

Segmentation is the critical stage in a computer vision system because it links the 
relatively well understood low-level processes with the primitives for describing 
the scene in a compact and meaningful way. 

After segmentation, the scene description is matched against preformed models 
to develop a consistent labeling. Here, the data structures are not fixed in size, and 


February 1992 


B-9162/92/0200-0043$03.00 © 1992 ] 


43 







no simple one-to-one or many-to-one 
fixed and even distribution of data to 
processors exists. Although the data 
structures are relatively small, sequen¬ 
tial algorithms developed for image in¬ 
terpretation 1 are dependent exponen¬ 
tially on the number of these features 
unless some form of heuristic control is 
included. Thus, parallel implementation 
of these algorithms is more suited to a 
multiple instruction, multiple data 
(MIMD) architecture. This raises addi¬ 
tional problems like allocating proces¬ 
sors to the various subprocesses of fea¬ 
ture selection, matching scene and model 
features, evaluating the selected hypoth¬ 
esis, and dynamically reallocating these 
processors as partial results are evalu¬ 
ated. In the last case, data parallelism 
(allocating more processors to specific 
parts of the scene) or algorithmic paral¬ 
lelism (allocating more processors to 
particular subprocesses like evaluation) 
may be employed. 

The two principal approaches to seg¬ 
mentation and model matching are based 
either on transformation from image 
space to parameter space or on “hy¬ 
pothesis and verification.” 1 We apply 
the first approach to plane detection in 
depth data and apply the latter approach 
to object recognition, although we could 
have equally chosen the reverse strate¬ 
gy- 

Whichever processing strategy is used, 
visual processing is affected greatly by 
heuristic control and the setting of 
thresholds. For mapping and accumula¬ 
tion in parameter space, the key thresh¬ 
old is that which determines the pres¬ 
ence of a peak, and hence a primitive or 
object in image space. For hypothesis 
and verification, the key threshold is 
the tolerance that is placed on a com¬ 
patible point in a plane or primitive in a 
model. Because parallel processing is 
our main interest and these problems 
have been discussed fully in a previous 
review, 1 we refer to them briefly, while 
acknowledging the significant difference 
they can make to presented results. 

Approaches to parallel- 
algorithm development 

There are three approaches to devel¬ 
oping parallel algorithms: direct imple¬ 
mentation, automated parallelism, and 
program transformation. In direct im¬ 
plementation, the programmer employs 


a language specifically designed for a 
parallel environment. Parallelism must 
be identified explicitly and the languag¬ 
es must be relatively low level to mirror 
the underlying machine architecture. 
Although the direct approach often leads 
to the most efficient implementation, 
the need to program the underlying ar¬ 
chitecture often complicates the pro¬ 
cess. 

For automated parallelism, the pro¬ 
grammer ignores the parallelism in the 
algorithm and employs a sequential lan¬ 
guage. When a satisfactory sequential 
implementation is achieved, the pro¬ 
gram is analyzed statically to identify 
code fragments that can be safely exe¬ 
cuted concurrently. This involves the 
detection of such things as access to 
global variables that must be protected 
(read/write races) and the need for com¬ 
munication synchronization. The se¬ 
quential program is then translated into 
a parallel program using rules that em¬ 
ploy the results from the static analysis. 
Most translation systems are based on 
generating an intermediate control and 
data graph to identify “abstractions” 
with equivalent parallel implementa¬ 
tions that are selected from a library of 
generic code fragments. The transla¬ 
tion phase can be used to hide the ma¬ 
chine architecture from the program¬ 
mer so that a different translation 
strategy can be employed in different 
hardware environments. 

Automatic translation is generally 
most useful in SIMD parallel architec¬ 
tures, where the translation results in 
replication of code applied to indepen¬ 
dent data. However, sequential pro¬ 
grams are often less amenable to direct 
parallelization for an MIMD environ¬ 
ment. Splitting a program into indepen¬ 
dent parallel units involves identifying 
intermodule dependencies to determine 
where parallel segmentation is safe. This 
is a complex and costly activity due to 
the use of assignment and shared data. 2 
Furthermore, during program segmen¬ 
tation, it is hard to tell whether the gains 
from identified parallelism offset the 
losses due to additional interprocess 
communication. Typically, parallel code 
produced by automatic translation is 
inefficient, retaining a basically sequen¬ 
tial structure as an artifact of the origi¬ 
nal style. 

For program transformation, 3 the pro¬ 
grammer employs parallel constructs in 
writing an initial version of the program 
without attempting to produce efficient 


parallel code. A rewrite system statical¬ 
ly analyzes this code and attempts to 
improve it by increasing parallelism or 
ensuring a balanced program execution. 
This is generally more successful than 
direct, sequential-to-parallel translation 
because the original code has been writ¬ 
ten in a parallel style. The programmer 
is not concerned with the fine details of 
balancing the program or obtaining the 
maximum parallelism from the algo¬ 
rithm. Such systems often include dy¬ 
namic analysis of the program in an 
attempt to identify data-dependent tim¬ 
ings or critical limits during actual exe¬ 
cution. The availability of the program¬ 
mer’s “first approximation” to the final 
code is used for this purpose. 

We have concentrated on the first 
and third methods, using Occam 2 as 
the implementation language in each 
case. This imperative programming lan¬ 
guage with explicit parallel control con¬ 
structs is based on the Communication 
Sequential Processes (CSP) formalism. 
It was designed primarily for program¬ 
ming the Inmos transputer, a high- 
performance von Neumann computer 
intended for multiprocessor configura¬ 
tions. Occam 2 provides integer, real, 
Boolean, and byte atomic data types 
that may be structured as multidimen¬ 
sional arrays. The basic control con¬ 
struct is assignment, which may be struc¬ 
tured through traditional sequences, 
selection, and replication. These se¬ 
quences are augmented with equivalent 
constructs for the explicit expression of 
parallelism. 

Occam 2 programs are composed of 
independent processes that communi¬ 
cate through message passing. This stat¬ 
ic parallel language lacks features for 
recursion and dynamic process creation. 
However, an Occam 2 program can be 
abstracted to a graph of connected pro¬ 
cesses that may be analyzed to elucidate 
data and control flows. 

A number of common parallel strate¬ 
gies are employed in imperative pro¬ 
gramming, and Occam 2 frequently uses 
them in algorithms. Figure 1 schemati¬ 
cally illustrates the options of vertical, 
fixed-width horizontal, and variable- 
width horizontal parallelism. Vertical 
parallelism occurs when each element 
of a data-item sequence is processed by 
a number of processes in turn. The pro¬ 
cessing of an element at one stage may 
be overlapped with the processing of 
another element at another stage. Fig¬ 
ure la shows the resultant pipelined 
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Figure 1. Common imperative parallel-programming strategies frequently used 
in Occam 2: vertical (a), fixed-width horizontal (b), and variable-width horizon¬ 
tal (c). 


processes. Fixed-width horizontal par¬ 
allelism is used when a known number 
of data items are to be processed inde¬ 
pendently and the results combined. 
Figure lb shows a tree-like graph of 
execution and combination, which is 
used when width corresponds to the 
number of independent data items. Fig¬ 
ure lc demonstrates variable-width hor¬ 
izontal parallelism when an unknown 
number of data items are processed in¬ 
dependently. A process farm is em¬ 
ployed. The farmer process communi¬ 
cates with a number of worker processes, 
passing data to them and combining the 
results. Because the number of worker 
processes is fixed and finite, each work¬ 
er may be reused to ensure that all data 
items are processed. 

Hough transform 

To illustrate direct implementation 
of transformation from image to pa¬ 
rameter space, we consider the Hough 
transform (HT) for feature detection 4 
in two- and three-dimensional image 
data. Most parallel HT implementations 
concentrate on the simplest case of ex¬ 
tracting straight lines from 2D image 
data using the p - 0 parameterization, 
x cos 0 + y sin 0 = p. For detected edge 
points in the image, x and y, and possi¬ 
bly the edge direction 0, are known. 
Therefore it is simple to compute p. If 0 
is not known, p is computed for all pos¬ 
sible, quantized values of 0. Following 
the mapping from image (x, y) to pa¬ 
rameter (p, 0) space, significant peaks 
indicate the presence of straight lines. 
The accuracy of location of the straight 
lines is determined by the resolution of 
the parameter space. 

The HT is expensive, both in compu¬ 
tation and storage requirements, which 
depend on the number of dimensions in 
image and parameter space. Two meth¬ 
ods of achieving data parallelism are 
appropriate. In the first strategy, the 
image space, or set of detected edge 
points, is distributed equally among the 
available processors. Although this pro¬ 
vides efficient computation of the p - 0 
parameters, the parameter “peaks” be¬ 
come distributed throughout the net¬ 
work. Therefore, in the second stage of 
peak detection, significant communica¬ 
tion costs are incurred. 5 In the second 
strategy, the Hough space is partitioned 
equally, and each processor computes 
the p - 0 parameters for all edge data. 


However, it increments the local accu¬ 
mulator only if these parameters lie with¬ 
in range. The redundancy of the compu¬ 
tation is balanced by peak-detection 
efficiency, since this consists simply of 
finding which processor has the most 
incremented points. For a particular 
parallel architecture, the optimum ap¬ 
proach depends on the computation/ 
communication cost ratio. 

Image space is usually restricted to 
two or three dimensions (although scale 
may also be incorporated). Parameter- 
space dimension equals the number of 
parameters in the equation. Generally, 
processors map more easily to 2D spac¬ 
es; for example, the transputer has four 
communication links that map readily 
to a 4-connected image or parameter 
grid. For higher dimensional spaces, the 
processor interconnection network be¬ 
comes more complex, and communica¬ 
tion costs increase. 

We concentrate on 3D image and 
parameter spaces, that is, the detection 
of planes of the form, z = ax + py + y, in 
actively acquired range data. We use a 
(0, <|>, r) parameterization for the plane, 
where the two angles of latitude and 
longitude are in the range 0 < <)> < it/2 and 
0 < 0 < 2jt and r is the normal distance. 



V(ct 2 +p 2 +l) 


The process of transformation is a 
one-to-many mapping from each image 
point ( x , y, z ) into many points in (0, 0, 
r) space. To obtain a one-to-one map¬ 
ping, a local least-squares fit can be 
performed on a local neighborhood to 
determine the values of 0 and 4>, which is 
equivalent to finding a local normal di¬ 
rection. 

We have further modified the basic 
Hough algorithm to exploit the greater 
flexibility of the MIMD architecture. 
There are generally not enough proces¬ 
sors to subdivide the parameter space 
equally for reasonable accuracy in plane 
detection. Because the parameter space 
is sparsely and unequally filled with ac¬ 
cumulated points, we adopted an Adap¬ 
tive Hough Transform (AHT). 6 This 
transform seeks to improve utilization 
of processing power and parameter space 
by adopting initially a very coarse quan¬ 
tization of parameter space, and then 
“focusing” to higher resolution only on 
those regions of parameter space with 
significant accumulations of votes. The 
resulting Hough space may be viewed 
as a pyramid or tree, whose depth is 
determined locally by the number of 
accumulated points with the correct pla¬ 
nar parameters. This structure results 
in a much-reduced accumulator size and 
more efficient allocation of processors. 

We have also adopted a heuristic based 
on the observation that within most 
images, planes are piecewise continu¬ 
ous; that is, points that lie on the same 
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SEQ (* Farmer processor *) 

Broadcast image to workers 

Collect results from each worker and sum into local accumulator 
Detect most significant peaks in accumulator 

PAR i = 1 FOR number of workers (* Worker processors *) 

SEQ 

Read image from farmer 

Accumulate plane parameters for sampled points in i’th image 
section 

Return resulting accumulator to farmer 


Figure 2. Algorithm Al, a parallel algorithm for coarse accumulation in Hough 
space. 


PAR 

(* Farmer process *) 

SEQ 

initialize (peak list) from coarse transformation (Al) 

WHILE not empty (peak list) and workers active 
ALT 

IF worker available THEN remove head (peak list) and send 
to worker 

IF secondary peaks available THEN add to peak list 
IF final peaks available THEN store result 
send terminate message to workers 

(* Worker process *) 

PAR i = 1 FOR number of workers (Worker processors) 

WHILE not (terminated) 

Read work message from farmer 
FOR i = start level TO max focus level 
Focus Hough space to next resolution 
Threshold peaks in accumulator 
IF not (max focus level) THEN 

retain principal peak and return secondary peaks to 
farmer 

ELSE return final peaks to farmer 


Figure 3. Algorithm A2, a parallel algorithm for iterative refinement of Hough 
space. 


plane in image space are generally adja¬ 
cent. Thus a controlled sampling proce¬ 
dure can provide initial estimates of 
point distribution in Hough space. 

The parallel AHT proceeds in two 
stages: Hough transformation into a 
coarsely quantized 5x5x5 accumula¬ 
tor, followed by iterative refinement of 
the most significant peaks so detected. 
At each stage, a single farmer process 


coordinates the actions of a number of 
worker processes. At stage 1, each work¬ 
er performs a standard HT on a sample 
of data points (typically 10 to 20 per¬ 
cent) within its own section of the depth 
image. The accumulators are then re¬ 
turned to the farmer, which combines 
them and detects the significant peaks. 
This geometric image decomposition 
provides priming information for the 


more interesting stage 2. If the cost of 
communicating the accumulators to the 
farmer outweighs the parallel advan¬ 
tage of decomposition (as can be the 
case for small images), coarse accumu¬ 
lation may be performed by the farmer. 

Algorithms Al and A2, shown in Fig¬ 
ures 2 and 3, contain pseudocode for the 
coarse and focusing stages. In each case, 
the first branch of the outer PAR con¬ 
struct represents processing performed 
on the farmer, and the second branch 
contains a replicated PAR construct 
representing processing on a number of 
workers. 

At stage 2, each worker accepts an 
accumulator peak from the farmer and 
refines it iteratively to a predefined res¬ 
olution before returning its result to the 
farmer. At each iteration, the accumu¬ 
lator limits are set by the previous iter¬ 
ation, the range of possible values being 
reduced as the resolution increases. Only 
the most significant peak is retained for 
local processing; at the next iteration, 
other peaks are returned to the farmer 
process, where they are queued for fu¬ 
ture refinement. Positional information 
is accumulated for each parameter cell, 
allowing another level of focusing, this 
time in image space. Farming continues 
until no more peaks are left and there 
are no active workers. A short message 
is exchanged between the worker and 
the farmer each time a parameter peak 
is refined, whereas in traditional ap¬ 
proaches messages are required for each 
point examined. 

We implemented two versions of al¬ 
gorithm AHT. In the first, local surface 
normals were calculated to give a one- 
to-one mapping from image space to 
parameter space. To retain the parallel 
advantage, the image space was sam¬ 
pled during stage 1, and only eight di¬ 
rections from a peak’s center in the 
image were considered during focusing. 
Because the surface normals were found 
to be sensitive to image noise, a second, 
more naive, version was developed in 
which each point was considered sepa¬ 
rately with no image sampling. This 
method gives a one-to-many mapping 
from image space to parameter space. 

It should be noted that heuristics are 
employed to evaluate the parallel AHT. 
A threshold for significant peaks in 
Hough space is set. 

Points in image space were sampled 
at both stages of the algorithm. Figure 4 
presents speedup (the ratio of execu¬ 
tion time on one to n processors) and 
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Figure 4. Execution time and speedup characteristics for the surface-normal version (a) and the naive version (b) of the 
Adaptive Hough Transform. 


execution time for a test image consist¬ 
ing of 16 rectangular planes of varying 
size. Figure 4a shows that speedup for 
the surface-normal version is 
restricted as the number of 
processors increases, falling 
quickly from the ideal straight 
line. Figure 4b shows that the 
speedup characteristic for the 
naive version is much closer 
to the ideal, although execu¬ 
tion time has increased sig¬ 
nificantly. Ideally, one pa¬ 
rameter peak occurs for each 
image plane. If the number 
of workers exceeds the num¬ 
ber of planes (which is influ¬ 
enced strongly by the thresh¬ 
old), some processors remain 
idle and contribute nothing 
to overall speedup. 

As Figure 5 shows, when 
the number of planes exceeds 
the number of processors, 
speedup approaches the op¬ 
timum in the 10-processor 
case. In the 17-processor case, 
saturation occurs at a ratio 
of 13:1, which indicates the 
relatively lower algorithmic 
complexity of the surface- 
normal version. The commu¬ 
nication overhead becomes 
more significant as the num¬ 
ber of worker-to-farmer mes¬ 
sages increases. The one-to- 
many mapping of the naive 
algorithm results in consid¬ 
erably more computations of 
planar parameters and a 
more densely filled parame¬ 
ter space. 

In Figure 4b, the speedup 
characteristic is close to lin¬ 
ear. However, the process¬ 


ing times for the surface-normal ver¬ 
sion are much lower than those for the 
naive version. This shows the inherent 


danger in evaluating parallel algorithms 
solely in terms of simple speedup mea¬ 
surements or communication-to-calcu- 
lation ratios. The greater the 
degree of dynamic control, 
the more the overall execu¬ 
tion time may be reduced by 
increase in communica- 
overhead. Hence, the 
faster of the two algorithms 
may be chosen even if it is 
less efficient. 

With reference to the test 
images 7 in Figure 6, the sur¬ 
face-normal version correct¬ 
ly detected 7 of the 10 planes 
shown; planes 1,5, and 6 were 
missed. Each missing plane 
has similar parameters to a 
detected plane. This is a rec¬ 
ognized problem in the 
Hough approach. Normally, 
we would expect the detected 
center to lie between the re¬ 
lated patches; image sampling 
can result in one or another 
patch being favored during 
iterative focusing. 

Execution times for the 
surface-normal version for 
plane detection compare fa¬ 
vorably with other parallel 
Hough algorithms for line 
detection in 2D parameter 
space. 8 The naive version 
demonstrates that processor 
farming allows efficient dy¬ 
namic control of resources 
when there are sufficient pro¬ 
cessing tasks to keep the work¬ 
er network busy. The tech¬ 
nique is inherently load 
balanced and can have a low 
communications overhead. 
The use of a routing harness 




Figure 6. A sample range image sharing numbered planes 
for evaluating the surface-normal version of the Adaptive 
Hough Transform. 
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Figure 7. Three fundamental functional/parallel relationships: function compo¬ 
sition F (Gx) (a), function composition F ( Gx, Hy, Iz) (b), and fixed-width 
function map, map F [e„ e 2 , e 3 ] (c). 


makes it easy to add processing resourc¬ 
es to the farm. Processor farming is also 
relevant to other low- and intermedi¬ 
ate-level visual processing on MIMD 
architectures when tasks can be split 
into independent subtasks. When cou¬ 
pled with a suitable adaptive control 
strategy, farms allow rapid focusing of 
resources on the most interesting re¬ 
gions of the scene. Dynamic focusing of 
resources could also be applied to gen¬ 
eralized Hough schemes for model 
matching with more complex parame¬ 
ter spaces. 

Functional prototyping 
of parallel algorithms 

Functional languages are used widely 
to construct prototypes. While impera¬ 
tive programs can be realized more effi¬ 
ciently, programs can be developed fast¬ 
er in a functional environment. Functional 
languages have been proposed as the 
basis of parallel system design because 
they lack assignment and so are evalua¬ 
tion-order independent. All data is passed 
explicitly as function arguments, making 
dataflows and dependencies readily iden¬ 
tifiable. 

The main approach to functional par¬ 
allelism has been the exploitation of 
implicit maximal parallelism within a 
shared-memory MIMD model. While 
there have been a number of successful 
parallel implementations of functional 
languages, they are extremely ineffi¬ 
cient. Maximal parallelism is very fine 
grain, which often leads to unaccept¬ 


able communication overheads. Another 
approach is to add explicit parallel con¬ 
structs to functional languages. While 
this enables more efficient, coarser grain 
implementations, it loses the correspon¬ 
dences between program and data struc¬ 
tures through an unsatisfactory mix of 
different programming paradigms. 

We use Standard ML (SML) for pro¬ 
totype construction and transformation. 
SML is a general-purpose programming 
language with a powerful functional 
subset. It is a strongly typed, applicative 
order language that provides a variety 
of data types, higher order functions, 
and functions as values. We restrict our¬ 
selves to the functional subset and make 
substantial use of function composition 
and recursion as program constructs. 

To use functional prototypes as the 
first stage in the development of imper¬ 
ative parallel programs, high-level rela¬ 
tionships between higher order func¬ 
tions and common parallel imperative 
control strategies must be considered. 
Such relationships are regular and scal¬ 
able, and their behavior can be predict¬ 
ed. Their identification permits the com- 
bination of explicit and implicit 
parallelism. The programmer employs 
familiar functional strategies in proto¬ 
type construction. A translation system 
can generate parallel imperative code, 
guided by prototype instrumentation. 

Figure 7 shows three fundamental 
functional/parallel relationships. A func- 
tion composition may be realized 
through vertical parallelism, as seen in 
Figure 7a. A function with a number of 
functions as arguments can be imple¬ 
mented through horizontal and vertical 


parallelism as a pipeline branch, shown 
in Figure 7b. 

We often wish to apply the same func¬ 
tion to a sequence of values, for exam¬ 
ple in the form of a list in which we 
recursively apply the function to each 
element. In a function map, each appli¬ 
cation of the functional argument may 
be carried out in parallel. If the list is of 
known fixed length and that length is 
relatively small, the map may be real¬ 
ized through explicit horizontal paral¬ 
lelism, as illustrated in Figure 7c. How¬ 
ever, the length of the list may vary at 
runtime, and the horizontal branching 
structure may be so wide as to make its 
construction impractical. An alterna¬ 
tive to fixed-branch parallelism is to 
implement a function map using the 
processor farm paradigm. In Figure lc, 
each worker in the farm executes the 
mapped function. 

Given a functional prototype, we need 
to identify function groupings that cor¬ 
respond to useful parallel processor 
configurations. In general, parallelism 
is useful if the time gained by executing 
processes on independent processors 
exceeds the overhead in communicat¬ 
ing data between the processors. Static 
complexity analysis provides broad 
bounds for program and subprogram 
execution times but gives little feel for 
actual subprogram behavior. Execution 
profiling with typical and pathological 
data sets provides empirical evidence of 
subfunctional behavior. 

A functional program is an explicit 
hierarchy of subfunctions that provides 
potential groupings of processes for 
placement on processors. This initial 
hierarchy may not provide groupings 
that realize useful parallelism. Func¬ 
tional languages enable the application 
of formal transformations to programs 
to construct equivalent programs with 
different behavioral characteristics. 3 
Prototype analysis information may be 
transformed consistently, obviating the 
need to reinstrument transformed pro¬ 
totypes. Hierarchical groupings of com¬ 
positions and mappings may be realized 
directly as hierarchies of pipelines con¬ 
taining farms — and farms containing 
pipelines. Standard transformations for 
function compositions and mappings 
may then be applied to restructure hier¬ 
archies and generate groupings with im¬ 
proved execution/communication bal¬ 
ances. This balance is based on the known 
performance of the functional proto¬ 
type and the predictable performance 
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Figure 8. Correspondence between parts in the image 
plane and the model for perspective inversion. 


/V- u"+ m 2 \"- v"— 2/m u"- v"= |u- v|“ 
l 2 u" u"+ n w"- w"- 2ln u"- w"=|u-w| 
m 2 v"- v"+ n 2 w"- w"-2 mn y"- w"=|v-w|" 

Figure 9. Relations between vectors in Figure 8. 


of the final imperative code. 
Such restructuring can be per¬ 
formed safely in a functional 
program because of locality 
of data and the lack of side- 
effects. In an imperative en¬ 
vironment, the possibility of 
global variables involves the 
reanalysis of the complete 
program. 

A major source of func¬ 
tional program inefficiency 
is the need to modify data 
structures through copying 
rather than through assign¬ 
ment. This practice can re¬ 
sult in unnecessary data com¬ 
munication in a parallel 
implementation. In the par¬ 
allel local-memory model, 
data placement is an impor¬ 
tant consideration. A distinc¬ 
tion can be made between 
data that is reused during each 
process activation and data 
that is used once for each 
activation and discarded. Par¬ 
tial application 9 can be used 
to identify those arguments, 
which are called initialization 
and activation data, respec¬ 
tively. Program transforma¬ 
tion can maximize use of par¬ 
tially applied functions and 
hence minimize data copy¬ 
ing. Arguments that are 
passed to nested function calls 
without being processed can 
be replaced with direct ref¬ 
erences to global variables. 10 
In the parallel imperative for¬ 
mat, using free variables and 
global values permits data to 
be communicated directly to 
the process where it is used, 
reducing communication 
overheads. 


PIA prototyping 
and 

implementation 

The parallel perspective inversion al¬ 
gorithm (PIA) provides a method for 
deciding an object’s location in a per¬ 
spective 2D image. 11 It initially detects 
line and arc segments. These are used to 
form scene triples that consist of sets of 
three points, where a central point has 
connections by line segments to two 


other points. A hypothesized match may 
be made between such a scene triple 
and an “L” junction in the model. This 
allows inversion of the perspective trans¬ 
form. The solution depends on deriving 
an estimate for the camera position based 
on the proposed match between the 
scene and model triple, illustrated in 
Figure 8. The model triple, PQR, is 


defined in the 3D object- 
coordinate system that has its 
origin at O(object). The vec¬ 
tors u, y, and w represent the 
positions of the points P, Q, 
and R in this coordinate sys¬ 
tem. The camera is represent¬ 
ed by the image plane and 
focal length (/) and is placed 
at the origin of the coordi- 
system for the camera 
O'(camera). The model PQR 
casts an image P'Q'R' onto 
the image plane. The vectors 
u', v', and w' are the positions 
of the model points P, Q, and 
R in the camera coordinate 
system. The vectors u", v", 
and w" are the unit vectors to 
the points P', Q', and R', re¬ 
spectively. 

We use the constraint that 
the lengths of the objects’ sides 
must remain constant, no mat¬ 
ter what coordinate system is 
state the relations 
between the vectors that must 
hold for certain values of /, m, 
and n, the lengths of the u', v', 
and w' vectors as shown in 
Figure 9. 

If the focal length and im¬ 
age size of the camera are 
known, scalar products can 
be found from information in 
the model and image, leaving 
/, m, and n as the three un¬ 
knowns. The length and di¬ 
rection information specifies 
the position of the model (in 
the camera coordinate sys¬ 
tem) that would cast the shad¬ 
ow P'Q’R' on the image plane. 
The solutions to this set of 
simultaneous equations can 
be found directly, giving up 
to four possible positions for 
the camera in the model co¬ 
ordinate system. 

The matching procedure 
selects data triples from the 
model and scene, and inverts 
the perspective transform 
from the hypothesized match. The cal¬ 
culated parameters for the camera posi¬ 
tion can be applied to the model to 
produce estimated positions for features 
in the scene that are compared to the 
actual scene. The match between the 
estimated model features and scene fea¬ 
tures may now be “scored.” Around 
each projected point, a nearest neigh- 
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Create scene triples — initialization 

Create model triples 

FOR all triple-triple pairs DO — generate triples (calculate all) 

Calculate transformation — calculate perspective transform (RT_calc) 
Back-project model to scene 

Count feature matches — score hypothesized match (score) 


Figure 10. Algorithm A3, the perspective inversion algorithm (PIA). 



(a) 




(b) 


Figure 11. Three parallel im¬ 
plementations: a processor 
farm containing a pipeline 
(a); a simple pipeline (b); and 
a pipeline of farms (c). 


fun calculate.all_[ ] model.triples = [ ] 

fun calculate.all model.data scene.data (scene.triple::Rest) model.triples = 
let val part_result = 
map 

( (score model.data scene.data) o (RT_calc scene.triple)) 
model.triples 
in 

part_result:: (calculate.all model.data scene.data Rest model.triples) 
end; 


Figure 12. SML for the top level of functional prototype. 


bor search locates the image feature 
points that match in type and position 
the projected model point. The observed 
match’s quality assessment uses the 
number of “close” (within 10 pixels) 
matches together with the total discrep¬ 
ancy in observed positions. 

Figure 10 illustrates how the top level 
of the PIA may be expressed. 

From an initial examination of algo¬ 
rithm A3, we can identify the two possi¬ 
ble parallel implementations shown in 
Figures 11a and lib. The first is to un¬ 
wind the FOR loop, leading to a process 
farm in which the farm workers are a 
simple pipeline of score and RT.calc 
processes, as shown in Figure 11a. The 
efficiency of this single-farm configura¬ 
tion has a linear dependence on the 
number of workers employed in the 
farm. The second implementation is to 
ignore the horizontal (farm) parallelism 
and form a single pipeline of RT.calc and 
score processes, as shown in Figure lib. 

We use algorithm A3 to guide the 
coding of our first functional prototype. 
Ignoring initialization, we obtain the 
SML shown in Figure 12. 

The imperative Occam version of this 
function, produced by translation, is 
directly comparable to the first imple¬ 
mentation shown in Figure 11. 

Construction of this functional pro¬ 
totype allows dynamic analysis of the 
score and RT.calc functions summarized 
in Table 1. The models contain between 
8 and 26 features that group to form 
between 16 and 144 triples. The scene 
data consists of single nonoccluded com¬ 
ponents, although the approach deals 
readily with occlusions. 11 Each scene 
contains between 19 and 48 features 
that group to form between 4 and 16 
triples. In cases 1,2, and 3, the model is 
the same as the component appearing 
in the scene. In case 4, the model is the 
same as that of case 3, but the scene is 
taken from case 1. Case 5 is the reverse 
of case 4. Figure 13 illustrates the su¬ 
perimposition of the relevant model for 
case 3. 

Prototype analysis suggests that the 
pipeline version of the A3 algorithm 
would be poorly balanced, since the 
RT.calc process executes only 20 per¬ 
cent of the time for the score process. 
The RT.calc process would quickly 
become blocked because it generates 
results faster than the score process con¬ 
sumes them. The farm-based implemen¬ 
tation appears to be the best choice for 
implementing the PIA. 
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Alternatively, the following symmet¬ 
ric transformation relationship holds for 
all map and compositions of any length: 

map (f o g) xs <=> ( (map f) o 
(map g)) xs 

and for portions of longer compositions: 

map (f o g o h) xs => ( (map f) o 
(map (g o h))) xs 

Thus, in the calculate.all function we 
can rewrite the mapping of the compo¬ 
sition of score and RT.calc as a compo¬ 
sition of two maps: 

let val part_result = ((map score 
model.data scene.data ) o (map 
(RT.calc scene.triple)) ) 
model.triples 


Table 1. Runtime profiles of ML functions for RT.calc and score. 


Case 

Function 

No. Calls 

Percentage of 

Milliseconds/call Execution Time 

1 

RT.calc 

264 

4.07 

15.83 


score 

404 

13.66 

81.29 

2 

RT.calc 

576 

4.70 

14.21 


score 

1,172 

13.49 

82.90 

3 

RT.calc 

256 

4.47 

14.54 


score 

480 

13.53 

82.52 

4 

RT.calc 

864 

4.06 

20.25 


score 

1,254 

10.61 

76.91 

5 

RT.calc 

176 

4.69 

16.51 


score 

358 

11.26 

80.68 


The effect of direct translation is 
shown as the pipeline of farms in Figure 
11c. Such a configuration cannot easily 
be achieved by direct examination of 
A3. In the previous farm configuration 
in Figure 11a, the execution time relat¬ 
ed directly to the number of workers in 
the farm; we could only increase or de¬ 
crease the number of farm workers to 
affect processing times. Now we can 
alter the ratio of workers between the 
two farms to balance the pipeline of 
farm processes. Dynamic analysis shows 
that the execution times for RT.calc 
and score are in the ratio 1:5 for typical 
models and scenes. Hence we ensure 
that a 1:5 ratio is maintained between 
the farm sizes. 


Examination of the functional proto¬ 
types shows that the data sets model.data 
and scene.data form initialization data 
for the score function. They may be 
removed as dynamic arguments through 
partial application. Table 2 shows a 
marked reduction in dataflow in the 
new prototype 
compared with 
the naive ap¬ 
proach without 
partial applica- 


prototype as a pipeline of two farms. 
Figure 14 on the next page shows the 
execution time and speedup for case 3 
for the single farm, demonstrating near- 
linear speedup. Figure 15a shows the 
execution times and Figure 15b the 
speedup graphs for the pipeline of two 


Table 2. Comparison of data-transfer quantities. 


We have imple¬ 
mented the origi¬ 
nal prototype as a 
single farm and 
the transformed 



Figure 13. Superimposition of the relevant model for 
case 3. 



Single Call 

All Calls 

Case Integers 

Real Numbers 

Integers 

Real Numbers 

Transferred in naive PIA 



1 

76 

126 

25,992 

43,092 

2 

98 

159 

68,208 

110,664 

3 

112 

180 

51,968 

83,520 

4 

92 

150 

41,584 

67,800 

5 

74 

123 

20*720 

34,440 

Transferred in partial application 


1 

76 

126 

76 

4,218 

2 

98 

159 

98 

8,499 

3 

112 

180 

112 

5,736 

4 

92 

150 

92 

5,562 

5 

74 

123 

74 

3,471 
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Figure 14. Execution time and speedup for c: 
single farm. 


farms, for both one and two workers in 
the RT calculation farm. Near-linear 
speedup is achieved by maintaining the 
ratio of farm sizes at 1:6 or more. Below 
this ratio, no further speedup is achieved 
when more score workers are added. 
This corresponds well to the predicted 
1:5 ratio from prototype instrumenta¬ 
tion. 

W e developed new parallel im¬ 
plementations of visual pro¬ 
cessing algorithms for seg¬ 
mentation and model-based interpreta¬ 
tion of visual scenes from existing se¬ 
quential implementations. In particular, 
we have examined the capability for 
dynamic control of algorithms on the 
basis of partial results (for example, on 
examination of progressively finer res¬ 
olutions in Hough space for plane de¬ 
tection, or within the search space of 
possible solutions based on matching 
triples in the PIA). This capability is a 
prime requisite for the intelligent ex¬ 
ploitation of MIMD architectures for 


higher level vi¬ 
sion, which must 
be treated in a 
quite different 
manner from low- 
level processing of 
pixel data. 

We demonstrat¬ 
ed that functional 
prototypes have 
desirable proper¬ 
ties for parallel- 
program develop¬ 
ment including 
their sound math¬ 
ematical basis, 
lack of explicit 
time ordering, and 
ease of implementation. Dynamic anal¬ 
ysis of a functional prototype of the PIA 
allowed rapid identification of the rela¬ 
tive complexity of the component pro¬ 
cesses on typical image data. Using 
general techniques for program trans¬ 
formation, we have encoded an imper¬ 
ative parallel version of the PIA that 
compares favorably with that produced 
by direct implementation. Transforma¬ 
tion also enabled the construction of 
consistent versions of the algorithm with 
different behavioral characteristics from 
the initial specification. However, SML’s 
strict evaluation order makes it more 
difficult to prototype time-dependent 
actions, like the opportunistic control 
of an algorithm during execution. 

We suggest that future exploitation 
of parallelism in computer vision sys¬ 
tems may be achieved not only within 
specific algorithms such as the AHT 
and PIA demonstrated here but also 
between different algorithms within a 
complete visual process architecture. 
Here, we hope to include both high- and 


low-level processes, and both shape- 
and surface-based processes (such as 
texture and color), within the same over¬ 
all control structure. ■ 
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Mapping Computer- 
Vision-Related Tasks onto 
Reconfigurable Parallel- 
Processing Systems 


Howard Jay Siegel, James B. Armstrong, and Daniel W. Watson 
Purdue University 


The authors 
demonstrate how 
reconfigurability can be 
used by reviewing and 
examining five 
computer-vision- 
related algorithms. 
Each one emphasizes a 
different aspect of 
reconfigurability. 


T he “need for speed” has been the single most influential factor in super¬ 
computer design. In the past, technology fueled the development of faster 
computers through better semiconductor devices and very large scale 
integration (VLSI). Technology, as a source of speed for a single processor, is 
bounded by the speed of light and physical limitations on miniaturization. Conse¬ 
quently, it has become necessary to replicate hardware to allow concurrent 
execution to achieve the performance requirements of many of today’s scientific 
and industrial applications. This concurrent execution, or parallel processing, has 
forced the reformulation of the most well-accepted sequential programs and even 
the mathematical rethinking of some problems. The parallel programmer needs to 
“think parallel.” 

Many parallel-processing systems of different sizes and configurations have 
been developed (see the “Models of parallelism” sidebar). The feasibility of 
systems with thousands of processors has become evident with the introduction of 
several types of massively parallel systems. As the size, hardware complexity, and 
programming diversity of parallel systems continue to evolve, the range of alter¬ 
natives for implementing a parallel task on these systems grows. Choosing the 
proper parallel algorithm and implementation becomes an important decision and 
has a significant impact on the performance of the application (see the “SIMD 
versus MIMD” sidebar). This article is a tutorial overview of how selected 
computer-vision-related algorithms can be mapped onto reconfigurable parallel¬ 
processing systems. 

The reconfigurable parallel-processing system assumed for the discussions here 
is a multiprocessor system capable of mixed-mode parallelism; that is, it can 
operate in either the SIMD (single instruction, multiple data) or MIMD (multiple 
instruction, multiple data) mode of parallelism (see the sidebars) and can dynam¬ 
ically switch between modes at instruction-level granularity with generally negli¬ 
gible overhead. In addition, it can be partitioned into independent or communicat¬ 
ing submachines, each having the same characteristics as the original machine. 
Furthermore, this reconfigurable system model uses a flexible multistage cube 
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interconnection network, 1 which allows 
the connection patterns among the pro¬ 
cessors to be varied. 

Thus, the system is reconfigurable 
along three dimensions: 

• mode of parallelism (SIMD/MIMD), 

• partitionability, and 

• interprocessor connectivity. 


Designed at Purdue University, the 
j PASM (partitionable SIMD/MIMD) 

I parallel-processing system is one such 

I machine, and its 30-processor small-scale 

I prototype (16 processors in its compu- 

l tational engine) is supporting active 

i experimentation. 2 Other machines ca- 

[ pable of some form of mixed-mode op- 

| eration include TRAC (Texas Recon- 

[ figurable Array Computer) 3 and Opsila. 4 

The main goal here is to demonstrate 
how reconfigurability can be used by 
reviewing and examining five comput- 
I er-vision-related algorithms. Each al- 

I gorithm has been chosen to make a 

I different point: 


partitioning a system for subtask paral¬ 
lelism can improve performance. 

• The recursive-doubling algorithm, 
used in computer-vision tasks to com¬ 
pute global minimums, maximums, etc., 
demonstrates that employing more pro¬ 
cessors for a task can increase execu¬ 
tion time; this is another reason for par¬ 
titioning a system. 

•The global-histogramming algo¬ 
rithm, used to compute global histo¬ 
grams of the pixel values in an image 
and study the gray-level intensity distri¬ 
bution, typifies the challenges of auto¬ 
matic parallelization of “dusty deck” 
serial algorithms. 

• The 2D discrete Fourier transform 
algorithm, used to study the spatial spec¬ 
tral characteristics of an image, empha¬ 
sizes the importance of a flexible inter¬ 
connection network. This 2D DFT 
algorithm is presented to show network 
requirements that are distinct from the 
previous algorithms. 

• The bitonic sorting algorithm, used 
to sort sequences (for example, collec¬ 
tions of objects in an image), was imple¬ 
mented on the PASM prototype in dif¬ 


ferent ways. Experiments to compare 
modes of parallelism are demonstrated. 

The mapping of each algorithm onto 
an SIMD versus MIMD versus mixed¬ 
mode parallel system is discussed. Al¬ 
though the PASM design, which can 
support 1,024 processors, is the target 
architecture for each algorithm imple¬ 
mentation, the parallelization strategies 
presented also can be adapted for other 
systems. 

Algorithm case studies 

Image smoothing. These computa¬ 
tions are representative of those per¬ 
formed in a wide range of window-based 
image processing algorithms. An image 
is stored in memory as a two-dimen¬ 
sional array (matrix) where each ele¬ 
ment, called a picture element, or pixel, 
is an integer whose value represents the 
gray-level intensity of the correspond¬ 
ing point in the discretized image. To 
generate an MxM smoothed image A' 
from an MxM image A, the average of 
the value of pixel (i, ;') of the original 


• The image-smoothing algorithm, 
used for noise reduction, shows how 


Models of parallelism 


SIMD machines 


MIMD machine 




• Single instruction-stream, multiple data-stream 

• PE — processor/memory pair 

• Control unit broadcasts instructions to processors 

• All active PEs execute same instruction 
synchronously in lockstep on own data 

• Single control thread, single program 

• Examples: AMT DAP, CLIP-4, CM-2, MasPar 
MP-1, MPP 


Multiple instruction-stream, multiple data-stream 

PE — processor/memory pair 

Each PE has its own instructions 

PEs execute local programs asynchronously on local 

data 

Multiple threads of control, different programs 
Examples: BBN Butterfly, Cedar, CM-5, IBM RP3, 
Intel Cube, Ncube, NYU Ultracomputer 


For further reading 

Almasi, G.S., and A. Gottlieb, Highly Parallel Computing, Benjamin/Cummings, Redwood City, Calif., 1989. 
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image and that of its eight nearest neigh¬ 
bors is computed and forms pixel (/,/) of 
the smoothed image A': 

A'(i,j) = [A(i-l,j-l) + 

A(i, /- 1) + A(i + I,/— 1) + 
A(i-l,j)+A(i,j) + 
A(i+l,j)+A(i~l,j + l) + 

A(i, j + \) + A(i + l, j + 1)] / 9 


In the case of an edge pixel, no calcu¬ 
lation is performed, and the pixel itself 
is taken to be the smoothed value. 
Because there are 4M - 4 edge pixels in 
anMxM image, the time to smooth an 
M x M image A on a serial machine is 
the time to execute M 2 - (4 M - 4) = 
0(M Z ) smoothing operations. For M = 
512, this is 260,100 smoothing opera¬ 


tions, approximately equal to M 2 - 
262,144. 

Because smoothing involves perform¬ 
ing the same operations for every pixel, 
very efficient SIMD implementations 
are possible. 5 Assume that there are N 
PEs (processing elements — processor/ 
memory pairs) available, logically ar¬ 
ranged as a V/V x V/V grid, and each PE 


SIMD versus MIMD 


SIMD advantages 

Ease of programming and debugging 

• SIMD: Single program, PEs operate synchronously. 

• MIMD: Multiple interacting programs, PEs operate 
asynchronously 

Overlap loop control with operations 

• SIMD: Control unit does increment and compare, 
while PEs “compute” 

• MIMD: Same PE does both 


MIMD advantages 

More flexible 

• No constraints on operations that can be performed 
concurrently 

Conditional statements more efficient 

• MIMD: Each PE executes as if uniprocessor 

• SIMD: “Then” and “else” execution serialized 

No SIMD control unit cost 


Overlap operations on common data 

• SIMD: Control unit overlaps operations that all PEs 
need (for example, common local array addresses) 

• MIMD: Same PE does all 

Reduced inter-PE transfer overhead 

• SIMD: “Send” and “receive” automatically 
synchronized 

• MIMD: Need explicit synchronization and 
identification protocol 

Minimal synchronization overhead 

• SIMD: Implicit in program 

• MIMD: Need explicit statements (for example, 
semaphores) 

Less program memory space required 

• SIMD: Store one copy of program 

• MIMD: Each PE stores own copy 

Minimal instruction decoder cost 

• SIMD: Decoder in control unit 

• MIMD: Decoder in each PE 


For further reading 

Berg, T.B., and H.J. Siegel, “Instruction Execution Trade-offs for SIMD 
versus MIMD versus Mixed-Mode Parallelism,” Proc. Fifth Int’lParallel 
Processing Symp., IEEE CS Press, Los Alamitos, Calif., Order No. 2167, 
1991, pp. 301-308. 

Jamieson, L.H., “Characterizing Parallel Algorithms,” in Characteris¬ 
tics of Parallel Algorithms, L.H. Jamieson, D.B. Gannon, and R.J. 
Douglass, eds., MIT Press, Cambridge, Mass., 1987, pp. 65-100. 


Variable-time instructions more efficient 

• Assume there is a block of instructions where the 
execution time of each instruction is data dependent 

• SIMD: Waits for slowest PE to execute each 
instruction (“sum of max’s”) 

T S!MU =£ max (instr. time) 

• MIMD: Waits for slowest PE to execute block of 
instructions (“max of sums”) 

T’mimd = max (instr. time) 

• Example: Execution of three instructions in SIMD 
mode and MIMD mode 
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Figure 1. Data allocation (a) and pixel transfers (b) for image smoothing. 


stores an MNN x MNN subimage (see 
Figure 1). Each PE performs at most 
A PIN smoothing operations. 

To smooth the pixels at the edge of a 
subimage, pixels from logically adja¬ 
cent subimages must be transferred (Fig¬ 
ure 1). Therefore, each PE requires at 
most M Nn pixels from each of the four 
adjacent PEs and one pixel from each of 
the four PEs diagonally adjacent to the 
PE. Thus, a worst-case total of 4(MNN) 
+ 4 inter-PE data transfers are required 
to perform the smoothing, where N pix¬ 
els are moved by each transfer. The 
inter-PE transfers needed for this algo¬ 
rithm can be done efficiently on 2D 
mesh networks, hypercube (single-stage 
cube) networks with embedded mesh¬ 
es, 6 and multistage cube networks. 1 

The execution time of the above algo¬ 
rithm when operating on an M x M 
image A with N PEs is the sum of A PIN 
smoothing operations and 4 (MNN) + 4 
inter-PE data transfers. Thus, for M = 
512 and N = 1,024, there are 256 smooth¬ 
ing operations and 68 inter-PE data 
transfers required. If the time to per¬ 
form one inter-PE transfer is equal to 
the time to perform one smoothing op¬ 
eration, the speedup S of the SIMD 
version over that of the uniprocessor 
algorithm is 

^ _ serial time 

parallel time 

[m - if 

M 2 IN + 4MIs[n +4 
This speedup calculation is based on 
smoothing and inter-PE transfer opera¬ 
tions (ignoring, for example, loop index 
variable manipulations) and 
the assumption that the uni¬ 
processor and each SIMD 
PE are of equivalent com¬ 
puting power. Theoretical¬ 
ly, the maximum possible 
speedup is N. For M = 512 
and N = 1,024, the speedup 
is 510 2 /324 s 803. However, 
if the time to perform a net¬ 
work transfer becomes much 
less than the time to per¬ 
form a smoothing operation 
—which is normally the case 
in SIMD mode—the speed¬ 
up is closer to N. The speed¬ 
up is not N even if communi- 
cation time is ignored, 
because the PEs containing 
image-edge pixels will be dis¬ 
abled for some smoothing 


operations and are therefore underuti¬ 
lized for some steps of the algorithm. 
This example demonstrates two condi¬ 
tions, inter-PE data transfers and dis¬ 
abling of PEs for some operations, which 
cause SIMD algorithms to execute with 
less than perfect speedup (that is, S < 
N). 

The smoothing algorithm can also be 
implemented in MIMD mode. Howev¬ 
er, the discussion of SIMD/MIMD trade¬ 
offs (see the “SIMD versus MIMD” 
sidebar) explains why there is little rea¬ 
son to prefer this mode. One MIMD 
implementation advantage would be 
manifest if the “divide-by-9” operation 
is data dependent, invoking the “sum of 
max’s” trade-off. However, because of 


the potential SIMD benefits of CU (con¬ 
trol unit)/PE overlap and implicitly syn¬ 
chronized transfers, SIMD mode would 
probably be best. 

Recursive doubling. The recursive¬ 
doubling procedure, 7 sometimes called 
tree summing, is a combining algorithm 
that can be used to apply any associa¬ 
tive operation (for example, min, max, 
sum, product) to a set of operands. Con¬ 
sider the task of finding the sum of N = 
1,024 numbers, for example, Z/4(i), 0 < 
i < 1,024. The following algorithm can 
perform this task on a serial machine: 

sum = A(0) 
for / = 1 to 1023 do 
sum = sum + A(i) 

One addition is performed 
per iteration for a total of 
1,023 = TV additions. 

Although this task appears 
to be sequential in nature, 
summing N numbers with N 
PEs by this procedure re¬ 
quires' only log 2 iV transfer- 
add steps, where a transfer- 
add is composed of the 
transfer of a partial sum to a 
PE and the addition of that 
partial sum to the PE’s local 
sum. This is demonstrated 
for N = 8 in Figure 2. Let T add 
be the time required to exe¬ 
cute an addition, and T tramfer . add 
be the time to execute a trans- 
fer-add. Then, the speedup 
of this algorithm is 
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The recursive-doubling technique can 
be extended to operate on a set of oper¬ 
ands whose size is greater than the num¬ 
ber of PEs. Let M be the number of 
operands and let /V = 2" be the number 
of PEs, numbered from 0 to N- 1, where 
PE P’s number in binary isp„_,... pj) 0 . 
Let each PE store l_M/jVj of the oper¬ 
ands, and let M mod N PEs receive one 
additional operand of the remaining M 
mod N operands. Each PE first oper¬ 
ates sequentially on its local data, re¬ 
quiring at most T MIN] operations. Once 
the local results have been obtained, 
log 2 iV transfer-ops are made. In trans¬ 
fer-op/, where j proceeds in time from 0 
to (log 2 /V) - 1, PEP =/?„_, ...p j+l 10 ... 0 
(that is, pj ... pj} 0 are fixed at 10 ... 0), 
passes its results to PE F =p„_,... p j+ i 00 
... 0 (that is,pj ...pj> 0 are fixed at 0... 0). 
PE P' uses the received result and the 
previously stored partial result to com¬ 
pute a new partial result. After log 2 /V 
transfer-ops, PE 0 will contain the glo¬ 
bal result. This is the sequence of trans¬ 
fer-adds used in Figure 2 for M = N = 8. 
The inter-PE transfers needed for this 
algorithm can be done efficiently in hy¬ 
percube and multistage cube networks. 1 
Mesh-based systems require additional 
hardware (as in the Massively Parallel 
Processor, or MPP, machine 8 ) to do this 
efficiently. 

As an example, let M = 16K, N = 128, 
and let each PE get MIN= 128 numbers. 
First, each PE sums its 128 numbers (1 
load and 127 adds), requiring approxi¬ 
mately the same amount of time as 128 
additions. Then, log 2 /V = 7 transfer-add 
steps are needed to combine the 128 
local sums into one global sum in PE 0. 
The total time is (128 x T add ) + (7 x 
^transfer-add) > compared to the serial time 
of 16K x 7’ add . If T transfer . add = x x P add , the 
speedup of this algorithm is 

y. 16KxP add 

(l28xP add ) + (7xxxT add ) 

16K _ M 

128 + (7 x x) Af//V + xxlog 2 /V 

There are advantages to implement¬ 
ing the recursive-doubling algorithm in 
SIMD mode, MIMD mode, and mixed¬ 
mode. The mode in which the local sums 
are computed depends on the type and 


number of operations performed, as well 
as on the machine implementation of 
the operation. For example, the addi¬ 
tion of M numbers, where M = pN and 
p is an integer greater than zero, re¬ 
quires one load and p - 1 additions to 
compute the local sums. These addi¬ 
tions could be normalized floating-point 
additions and therefore may take vari¬ 
able time to execute. Let Tf represent 
the time to perform addition i on PE P 
and let /Vequal the number of PEs used. 
Then, the time to perform p - 1 addi¬ 
tions in MIMD mode is 



and in SIMD mode is 


(see the “SIMD versus MIMD” side- 
bar). Because T MIMD < T SIMD , the time to 
compute the local sums in MIMD would 
be less than or equal to the time to 
compute them in SIMD mode. Howev¬ 
er, the addition instruction would most 
likely be contained within a loop. The 
loop control instructions can be execut¬ 
ed in the CU in SIMD mode but must be 
executed on the PEs in MIMD mode. 
Whether the advantage of CU/PE over¬ 
lap in SIMD mode outweighs the MIMD 
advantage in executing variable-time 
instructions is dependent on the ma¬ 
chine implementation of a floating-point 
addition as well as the actual data used. 

The above discussion also applies to 
the min/max operations. The computa¬ 
tions involve a data-conditional state¬ 
ment of the form 

if (number > max) then 
max = number 

This conditional statement can be re¬ 
garded as a variable-time instruction. 
In this case, the time to perform p - 1 
comparisons in SIMD mode and MIMD 
mode would be given by the respective 
equations above: Tf now denotes the 
time to perform comparison i on PE P. 
The above trade-offs between the two 
modes would apply, but there is a great¬ 
er variability in the execution time of 
the data conditional, which would, in 
general, strongly favor an MIMD im¬ 
plementation. When p = 2, only one 
operation is performed to compute the 
local result. Substituting p = 2 into the 


equations above yields the theoretical 
result T SIMD = T MIMD . Thus, the preferred 
mode of execution of one addition or 
max operation depends on the machine 
implementation details (for example, in¬ 
struction fetch time). 

Consider the process of combining 
the local sums. There are log 2 /V transfer- 
ops, where each transfer step is separat¬ 
ed by a single operation. As mentioned 
above, this single-operation performance 
for SIMD and MIMD modes is virtually 
equal. Therefore, the combining pro¬ 
cess can best be performed in SIMD 
mode because there is less transfer over¬ 
head and potential for CU/PE overlap. 
Furthermore, the transfers involved in 
the recursive-doubling algorithm must 
be executed in a constrained order, which 
forces the PEs to synchronize between 
transfers. Hence, there is no advantage 
to executing any part of the combining 
process in MIMD mode. 

The best implementation of the whole 
recursive-doubling algorithm must con¬ 
sider both the local calculations and the 
inter-PE combining phases. The pre¬ 
ferred approach for the whole algorithm 
would be either all SIMD or mixed¬ 
mode, depending on whether SIMD or 
MIMD, respectively, is optimal for the 
local phase. 

Global histogramming. Let anMxM 
input image be mapped onto N PEs such 
that each PE holds M 2 /N pixels, as in the 
image-smoothing discussion (see Fig¬ 
ure 1). Global histogramming involves 
computing B bins, where each bin has 
two attributes associated with it: the 
range of values that each bin represents 
and the number of pixels in the entire 
image that have values within that range. 
For this algorithm, 5 it is assumed that N 
is an integer multiple of B, and there is 
one bin for each possible pixel value. 

Each PE first computes a local B-bin 
histogram for the AfV/Vpixels in its mem¬ 
ory. Let A(x, y ) be the gray-level value 
of the pixel in row x and row y, and let 
bin(i) be initialized to 0,0 < i < B . If each 
PE contains an ( MNn) x ( MNN) sub¬ 
image, then an algorithm to compute 
the local fl-bin histograms could be 

for x = 0 to (MNN) - 1 
for y = 0 to (MNN) - 1 
bin(A(x, y)) = bin(A(x, y)) + 1 

(for the serial algorithm, set N = 1). The 
PEs then combine their local histograms 
with the local histograms of all other 
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Figure 3. Illustration of the global-histogramming algorithm 
for N = 16 and B = 4. 


PEs. The straightforward 
approach to combining the 
local histograms is to com¬ 
bine one bin at a time using 
recursive doubling, requir¬ 
ing Blog 2 /V transfer-add 
steps. 

Consider an overlapped 
recursive-doubling proce¬ 
dure for combining local 
histograms, where all of the 
bins are summed concur¬ 
rently (see Figure 3). In the 
figure, (w, ..., z) denotes 
that bins w,..., z are accu¬ 
mulated in that PE. The N 
PEs are logically divided 
into NIB blocks of B PEs. 

In the first b = log 2 B stages, 
the NIB blocks simulta¬ 
neously combine their his¬ 
tograms. Each PE in a block 
holds a different bin com¬ 
puted by summing the val¬ 
ues of the corresponding 
local bins of the PEs in the 
block. This is done by di¬ 
viding each block of PEs in half such 
that the PEs with lower addresses form 
one group and the PEs with the higher 
addresses form another group. Each 
group accumulates the sums for half the 
bins and sends the bins it is not accumu¬ 
lating to the other group. 

Figure 3 shows this process for N = 16 
and B = 4. For example, in stage 0, PE 0 
accumulates bins 0 and 1 from PE 0 and 
PE 2. Simultaneously, PE 1 accumu¬ 
lates bins 0 and 1 from PE 1 and PE 3. 
The next stage involves dividing each 
group of Bl2 PEs into two different 
groups of 6/4 PEs formed once again by 
PEs with lower addresses and higher 
addresses. These two new groups ex¬ 
change bins the same way as before, but 
only for those bins for which they had 
accumulated sums in the last stage. For 
example, in stage 1, PE 0 accumulates 
bin 0 from PE 0 and PE 1. The subdivid¬ 
ing process continues until there is one 
PE in each group and the sums for each 
bin have been completely accumulated 
for the portion of the image contained 
in that block — each bin in a different 
PE. 

The next n - b stages, where n = 
log 2 iV, combine the partial histograms 
of all the blocks by performing B simul¬ 
taneous recursive-doubling operations 
(Figure 3). Each of these 5 recursive¬ 
doubling operations involves those NIB 
PEs that store the sums for the same bin 


index. As a result, the histogram for the 
entire image is distributed over 5 PEs 
where bin i is located in PE i for 0 < i < 
B. 

At each stage j, for 0 < j < b, BI2 i+i 
transfer-adds take place. The total num¬ 
ber of transfer-adds for the first b stages 
is 

£(6/2' +1 )=6-l 

For each stage j where b < j < n, one 
transfer-add occurs. The final n-b stag¬ 
es require log(/V/6) = n - b transfer- 
adds. The total number of transfer-adds 
needed to merge the local histograms 
using the overlapped recursive-doubling 
scheme is then 6 - 1 + log 2 (/V/5). 

In the case of 6 = 256, N = 1,024, and 
M = 512, the serial histogramming algo¬ 
rithm would require approximately M 1 
= 256K additions. The SIMD algorithm 
requires A PIN = 256 additions to com¬ 
pute the local histograms. If the straight¬ 
forward recursive-doubling algorithm 
is used to combine the local histograms, 
61og 2 /V = 2,560 transfer-adds are neces¬ 
sary. By comparison, the non-obvious 
overlapped recursive-doubling algo¬ 
rithm requires B - 1 + log 2 (A76) = 257 
transfer-adds. The merging of the local 
histograms using the non-obvious meth¬ 
od yields nearly a factor of 10 (approx¬ 
imately log 2 /V) speedup over its obvious 
counterpart. 


The inter-PE transfers 
needed for the global-his¬ 
togramming algorithm can 
be done efficiently in hy¬ 
percube networks and mul¬ 
tistage cube networks. 
While some mesh-based 
systems have the extra 
hardware to perform re¬ 
cursive doubling, the simul¬ 
taneous recursive dou¬ 
blings used here are not 
efficiently implementable. 

Global histogramming 
involves integer additions, 
so the time to compute lo¬ 
cal histograms is not data 
dependent and could be 
performed fastest in SIMD 
mode because of the po¬ 
tential for CU/PE overlap. 
This assumes that all PEs 
can simultaneously access 
local bin(/4(jt,y)) locations 
and that the actual PE 
memory locations ad¬ 
dressed may differ among 
PE processors. For the combining pro¬ 
cess, the large number of inter-PE trans¬ 
fers and the potential for CU/PE over¬ 
lap makes SIMD mode the obvious 
choice for this phase. No data-depen- 
dent conditionals are involved because 
indexed addressing can be used by each 
PE to determine which bin is trans¬ 
ferred at each stage. Therefore, SIMD 
mode is most appropriate for global 
histogramming. 

It is clear from this example that while 
several parallel algorithms may have 
increased performance over the serial 
algorithm, an optimal mapping of a task 
onto a parallel machine is often a subtle 
method that is derived from a compar¬ 
ison of many alternatives. The non-in- 
tuitive structure of this algorithm also 
demonstrates the challenges in design¬ 
ing compilers that automatically con¬ 
vert “dusty deck” serial code into fast 
parallel algorithms. While reconfigu- 
rable systems may provide the possibil¬ 
ity of increased performance, the great¬ 
er flexibility in configuring these systems 
adds to the difficulty of producing com¬ 
piler-generated parallel code that makes 
optimal use of the architecture. 

2D discrete Fourier transform. A 2D 

DFT of an M x M image can be con¬ 
structed by first taking the ID DFT of 
each row and then taking the ID DFT of 
each column of the resulting M x M 
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for k = 1 step + 1 until log 2 TV do 
for i-k-t step - 1 until 0 do 
for q = 1 step + 1 until MIN do 
load X[q ] into network 
send to PE whose number differs in bit i 
Y[q\ <r- network output 
merge(A, Y) 
swap(A', Y) 


Figure 5. Bitonic sequence-sorting algorithm. 


r transfer = r comp ,„ 

puted as 



problem size for TV = 16 PEs. 


array (see Figure 4). Consid¬ 
er an SIMD parallel imple¬ 
mentation of this approach 9 
with TV = M PEs, numbered 0 
to M - 1. Assume that PE i 
contains a variable PE# = i; 
that is, each PE “knows” its 
own number. Original image 
elements IQi, 0)..., I(h, M- 1) 

(that is, row h) are initially 
stored in PE h, where 0 <h< 

M. Each PE then performs a 
ID A/-point DFT on the row 
stored in its memory. The ID 
FFT (fast Fourier transform) 
algorithm is used to compute 
the ID DFT in each PE. The 
result is the M x M array G. 

PE h has created row h of G, 
where [ G(h , 0) ..., G(h, M - 
1)] = DFT([/(/z, 0)..., I(h,M- 
!)])• 

The DFTs of the columns 
of G must now be computed. 
However, each element of 
column w,0<w<M, [G(0, w ) 

..., G(M -1, w)] is located in 
a different PE; that is, G(h , 
w) is stored in PE h. One way 
to perform a DFT on the col¬ 
umns of G is to move each 
element of the same column 
to the same PE, that is, so 
that PE ; holds column j of G. 

The new arrangement of G is 
equivalent to the transpose 
of G. Then, each PE can com¬ 
pute the DFT (via the FFT 
algorithm) of the column of 
G in its memory to obtain F T , 
the transpose of the DFT of I. 

PE h calculates column h of 
F, where [F(0,h) ...,F(M-l, 
fc)]=DFT([G(0,/i) ...,G(M- 

1. A>3- 

Taking the transpose of G 
requires moving element 
G{h, w) (element w in PE h) 
to location G T ( w , h) (ele¬ 
ment h in PE w). This can be 
done in M - 1 inter-PE trans¬ 
fers. For transfer i, 1 < z < TV/, 

PE h fetches element G(h, h 
+ i mod M) from its memory 
and sends it to PE h + i mod 
M, which stores it as G T ( h + 
i mod M, h). PE h fetches 
G(h, h + i mod M) from its memory 
location &G + ((PE# + z) mod M), where 
&G is the address of G(PE#, 0). PE h + 
i mod M stores the received element in 
its memory location &G T + ((PE# - i ) 


mod M), where &G T is the address of 
G 7 (PE#, 0). The transfer used is “+ i 
mod M." Finally, the diagonal is moved 
from &G + PE# to &G T + PE# within 
each PE. 


The serial time needed to 
calculate a 2D DFT of an M x 
M image is dominated by the 
M 2 log 2 M complex multiplica- 
((M/2)log 2 M for each 
of 2 M ID M -point FFTs). 
For M = 1,024, a total of 
10,485,760 complex multipli¬ 
cations are required. In the 
SIMD approach, M FFTs are 
done simultaneously to trans¬ 
form / to G, with each PE 
performing (TW/2)log 2 M com¬ 
plex multiplications. G is then 
transposed by executing M - 
1 inter-PE data transfers. Fi¬ 
nally, (MI2)\og 2 M complex 
multiplications are needed by 
each PE to transform G T to 
F T . (Depending on the appli¬ 
cation, F^ may be used in 
place of F.) Therefore, the 
parallel time is dominated by 
the Mlog 2 M complex multi¬ 
plications and M - 1 trans¬ 
fers. A multiplicative factor 
of M is achieved for the speed¬ 
up on complex multiplica¬ 
tions, at the cost of an addi¬ 
tive term of M - 1 inter-PE 
data transfers. For the above 
example with M = 1,024, the 
SIMD algorithm would re¬ 
quire 10,240 complex multi¬ 
plications and 1,023 transfers. 
The speedup, assuming that 
y , is com- 

M 2 \og 2 M 
M log 2 M + M -1 
For the case of M = 1,024, the 
speedup is approximately 
931. 

Using the multistage cube 
network, all PEs can perform 
each “+ i mod TV” network 
transfer required for the 
transpose, for a fixed i, l<i< 
TV, in a single pass. Neither 
the mesh nor the hypercube 
topology can perform each 
of these transfers in a single 
pass (intermediate nodes 
must be traversed). 

Each ID FFT computed in 
each PE can be executed in 
either SIMD mode or MIMD mode. In 
SIMD mode, there is potential for a 
great deal of CU/PE overlap, as there 
are three nested loops in most FFT cod¬ 
ings. However, because of the floating- 
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point computations involved, the time 
to execute many of the instructions 
may be data dependent, invoking the 
“max of the sums” versus “sum of the 
max’s” advantage for MIMD. In addi¬ 
tion, some highly efficient FFT imple¬ 
mentations contain several condition¬ 
al statements that are used to detect 
special cases where a simplified ap¬ 
proach can be employed, and condi¬ 
tionals are performed more effectively 
in MIMD mode. The final choice of 
mode depends on details of the actual 
FFT implementation and machine ar¬ 
chitecture used. The transpose can be 
done more efficiently in SIMD mode 
because of the CU/PE overlap and the 
reduced inter-PE transfer overhead ad¬ 
vantages. Thus, either a mixed-mode 
version, with the ID FFTs done in 
MIMD mode and the transpose done 
in SIMD mode, or a pure SIMD-mode 
version should be employed, depend¬ 
ing on specifics of the algorithm and 
machine. 

Bitonic sequence sorting. Consider 
the bitonic sorting of sequences on the 
PASM prototype. 2 Assume there are 
M numbers and N = 2" PEs, where M is 
an integer multiple of N , and that MIN 
numbers are stored in each PE — ini¬ 
tially sorted. The goal is to have each 
PE contain a sorted list of MIN ele¬ 
ments, where the elements in PE i are 
less than or equal to the elements in PE 
k, for i < k. The regular bitonic sorting 
algorithm, 10 where M = N, is modified 
in Figure 5 to accommodate the MIN 
sequences in each PE. 2 Instead of per¬ 
forming a comparison at each step, an 
ordered merge is done between the 
local PE sequence X and the trans¬ 
ferred sequence Y using local data- 
conditional statements (“merge(A', 
Y)”). The lesser half of the merged 
sequence is assigned the pointer X and 
the greater half is assigned the pointer 
Y. The pointers to the two lists are then 
swapped, based on a precomputed data- 
independent mask (“swap(X, Y)”). 

When choosing the mode of paral¬ 
lelism, the programmer must consider 
two salient characteristics of the algo¬ 
rithm. First, the ordered merge involves 
many comparisons that can be more 
efficiently computed in MIMD mode. 
Second, the algorithm requires many 
network transfers, which are better 
performed in SIMD mode. To evalu¬ 
ate different approaches to this algo¬ 
rithm, a pure SIMD, a pure MIMD, 


and two mixed-mode implementations 
have been executed on the prototype. 

In the S/MIMD (SIMD/MIMD) 
mixed-mode implementation, the or¬ 
dered merge and swap routines were 
executed in MIMD mode, while the rest 
of the operations, including network 
transfers, were performed in SIMD 
mode. This algorithm has an advantage 
over pure SIMD and pure MIMD im¬ 
plementations because all comparisons 
are done in MIMD mode and all net¬ 
work transfers are done in SIMD mode. 
Additionally, there is potential for sig¬ 
nificant CU/PE overlap in the SIMD 
instructions. 

The BMIMD (barrier MIMD) mixed¬ 
mode implementation uses MIMD mode 
but uses barrier synchronization 11 to 
synchronize all inter-PE transfers. This 
is typically performed in three steps. 
First, each PE arrives at a synchroniza¬ 
tion point in an algorithm called the 
barrier. Next, each PE will wait at the 
barrier until all the PEs have “an¬ 
nounced” that they are at the barrier. 
On PASM, this is accomplished by fetch¬ 
ing a word from the SIMD address space, 
thus using the SIMD instruction fetch 
synchronization hardware to implement 
the barrier. Finally, all PEs continue 
execution simultaneously. During the 
bitonic sorting algorithm, the PEs bar¬ 
rier synchronize before each inter-PE 
transfer. Consequently, the PEs can 
perform the transfer without the over¬ 
head normally involved with MIMD 
network transfers. Thus, the BMIMD 
implementation has the advantage of 
performing data-dependent condition¬ 
als in MIMD mode but performs barrier 
synchronization to reduce inter-PE data 
transfer overhead. Therefore, its per¬ 
formance would be expected to be bet¬ 
ter than pure SIMD or pure MIMD. 

Figure 6 shows the results of the SIMD, 
MIMD, S/MIMD, and BMIMD algo¬ 
rithms for the bitonic sorting problem 
with N = 16 PEs. There is a significant 
improvement in execution time for both 
mixed-mode algorithms. S/MIMD per¬ 
formed better than BMIMD, with the 
difference increasing with M, mainly 
because of the CU/PE overlap. The 
mixed-mode results are the product of 
properties inherent to the modes of par¬ 
allelism and not artifacts of the proto¬ 
type construction, as discussed by 
Fineberg et al. 2 The PASM prototype is 
a constantly evolving tool for under¬ 
standing the programming and design 
of parallel-processing systems. 


Mapping algorithms 
onto partitionable 
systems 

Two potential advantages of a parti¬ 
tionable parallel-processing system are 
demonstrated — the first involving sub¬ 
task parallelism and the second consid¬ 
ering the number of PEs assigned to a 
task. 

Impact of subtask parallelism. The 

effect of partitioning a parallel task into 
smaller, concurrent subtasks can have 
an impact on performance. 12 Consider 
the goal of smoothing four images such 
that the total time to smooth all four is 
minimized. 1 Two possible ways of per¬ 
forming this computation are to smooth 
the four images sequentially on all N 
PEs or to partition the task such that all 
four images are smoothed concurrent¬ 
ly, each using Nl 4 PEs. 

The time to smooth the four images 
in sequence is four times that to smooth 
a single image. Let one time step be the 
time required to perform a smoothing 
operation. Assuming that each inter- 
PE data transfer requires one step, the 
total time is 4 x (AfV/V + 4 (MNN) + 4) 
steps. For M = 512 and N = 1,024, this is 
1,296 steps. 

The total time for N PEs to smooth 
the four images concurrentl y, eac h on 
NI4 PEs, is M 2 /(NI4 ) + 4 (M / V/V/4)+ 4. 
For M = 512 and N = 1,024, this is 1,156 
steps. Thus, partitioning the system and 
exploiting subtask parallelism decreas¬ 
es execution time. 

The reason for the reduced execution 
time by partitioning is that fewer inter- 
PE transfers are needed, 4 x 
(M /Vn 14) + 4 versus 4 x (4(mNn) + 
4). For the example with M = 512 and N 
= 1,024, this is 132 versus 272 inter-PE 
transfers. 

The efficiency E of a parallel imple¬ 
mentation, which measures the amount 
of incurred overhead, is 

_ _ speedup _ serial time 

# PEs (# PEs) x parallel time 

4 x (M - if 
N x parallel time 

For N = 1,024 and M = 512, the efficien¬ 
cy of smoothing four M x M images in 
sequence is 78 percent, while the effi¬ 
ciency of smoothing all four images si¬ 
multaneously on a system partitioned 
into four submachines of size A/4 PEs 
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2 7 

2 8 

2 9 

glO 

2 11 

2 12 

2 13 

2 14 

Time units 

198 

144 

122 

116 

118 

124 

132 

141 


Figure 7. Execution time versus number of PEs ( N) for 
M = 2 14 , x = 10. 


each is 88 percent. The effi¬ 
ciency improved because the 
larger subimage size (32 by 
32 versus 16 by 16) reduces 
the percentage of the total 
execution time spent doing 
inter-PE data transfers (132/ 

(32 2 + 132) = 11 percent ver¬ 
sus 68/(16 2 + 68) = 21 per¬ 
cent). 

This example illustrates 
how partitioning for subtask parallel¬ 
ism can be used to improve performance. 
In this case, both execution time and 
efficiency are improved. 

Impact of increasing the number of 
processors. Consider the impact of in¬ 
creasing N on the performance of both 
the image-smoothing algorithm and the 
recursive-doubling algorithm. 12 Recall 
that smoothing an M x M image re¬ 
quires each PE to perform A PIN smooth¬ 
ing operations and 4{MNN) + 4 inter- 
PE data transfers. The execution time 
decreases as N increases. The denomi¬ 
nator of the expression for the efficien¬ 
cy of the parallel smoothing algorithm, 
N x (MVN + 4(MNN) + 4) = M 2 + 4 AfV/V 
+ 4 jV, increases with N, so as N increas¬ 
es, the efficiency decreases. Thus, in¬ 
creasing N improves the total execution 
time but causes the efficiency to de¬ 
crease. This inverse relationship is a 
marked contrast from serial-algorithm 
performance. 

The impact of increasing N has a dif¬ 
ferent effect on the recursive-doubling 
algorithm than on the image-smoothing 
algorithm. Assuming MIN numbers are 
stored in each PE, where N = 2", the 
execution time, T total , is 

7) oa d + ((M/AO -1) 

+ (log 2 n) r transfer . add 

In this case, as N increases, the execu¬ 
tion time first decreases and then in¬ 
creases. Assume a transfer-add takes x 
times as long as an addition; that is, 

transfer-add = tx T add , and T load = T add . The 

denominator of the efficiency for this 
algorithm is M + x/Vlog 2 A, so the effi¬ 
ciency always decreases as N increases. 

The partial derivative with respect to 
N of the execution time yields 


the execution time decreases, and as N 
increases beyond {Mix) ln2, the execu¬ 
tion time continually increases. The 
crossover point is the value of N = 2" 
such that I N-(Mix) In21 is minimal. For 
example, if x = 10 and M = 2 14 , then the 
execution time is (2 H IN + 10 log 2 A0 x 
T add time units (see Figure 7). The cross¬ 
over point is calculated as N = 2" such 
that I N- (2 14 /10) ln2 I is minimal. Thus, 
N = 2 10 . Given a system with 2 14 PEs and 
the assumptions made, the recursive¬ 
doubling-algorithm execution time can 
be minimized by performing it on a 
partition of size 2 10 PEs, and the other 

2 « _ 210 p Es can b e usec j f or othe,- j 0 b S- 

Thus, increasing N may or may not 
improve an algorithm’s overall execu¬ 
tion time. Furthermore, for parallel al¬ 
gorithms improved efficiency may not 
imply improved execution speed, and 
vice versa. 


T his article introduced some of 
the issues pertinent to the map¬ 
ping of computer-vision-related 
algorithms onto a class of large-scale 
reconfigurable parallel-processing sys¬ 
tems. Currently available commercial 
massively parallel systems have their 
roots in academic research from the 
past two decades. Aspects of reconfig¬ 
urable systems that are now being ex¬ 
plored in both academia and industry 
will become part of the next generation 
of massively parallel machines. 

Three of the dimensions of parallel¬ 
ism were examined here through a set 
of case studies. These aspects of paral¬ 
lelism should be considered in the de¬ 
sign and selection of future large-scale 
parallel-processing systems for comput¬ 
er-vision applications. ■ 
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PROJECT OVERVIEWS 


The three overviews that follow are short reports of 
ongoing research in image understanding architecture, 
SIMD parallelism in computer vision, and software 
environments for parallel computer vision. 


Image Understanding Architecture: 

Exploiting Potential Parallelism in Machine Vision 

Charles C. Weems, Edward M. Riseman, and Allen R. Hanson 
University of Massachusetts, Amhurst, MA 01003 


P arallel processing is now gen¬ 
erally accepted as necessary to 
support real-time image under¬ 
standing applications. Much debate re¬ 
mains, however, about what form of 
parallelism to employ. Part of this de¬ 
bate stems from the tremendous amount 
and variety of potential parallelism in 
machine vision. 

The sensory data alone is a good ex¬ 
ample: a medium-resolution image (512 
x 512 pixels) consists of roughly a quar¬ 
ter of a million data values. In many 
cases, each of these values might be 
processed in parallel. Further, if images 
are obtained from a video camera, the 
steady stream of data lends itself to 
pipelined parallelism. Some data in¬ 
volves multiple sensors (for example, 
stereo or nonvisual spectral bands), thus 
providing yet another potential source 
of parallelism. Nor is it unusual to ex¬ 
tract many different features from a 
given image or set of images (for exam¬ 
ple, lines, regions, texture patches, depth 
maps, and motion parameters), and these 
processes may also be carried out in 
parallel. 

Beyond the sensory data, image un¬ 
derstanding involves knowledge-based 
processing; and between these two lev¬ 
els of abstraction, symbolic processing 
has proved useful. Thus, vision research¬ 
ers tend to classify algorithms and rep¬ 
resentations into three levels: low (sen¬ 
sory), intermediate (symbolic), and high 
(knowledge-based). 

Of course, the existence of multiple 


levels of abstraction is yet another source 
of potential parallelism. Moreover, pro¬ 
cessing within each level presents many 
possibilities for exploiting parallelism. 
Part of the allure of developing a vision 
machine, from a computer architect’s 
perspective, is this tremendous quanti¬ 
ty, diversity, and complexity of latent 
parallelism. By comparison, most scien¬ 
tific and engineering applications have 
simple organizations with straightfor¬ 
ward requirements for parallelism. (For 
more detailed analysis of the potential 
for parallelism in image understanding, 
see Weems. 1 ) 

Image Understanding Architecture. 

Over the past five years, the University 
of Massachusetts and Hughes Research 
Laboratories have worked together to 
develop a hardware architecture that 
addresses at least part of the potential 
parallelism in each of the three levels of 
vision abstraction. A l/64th-scale proof- 
of-concept prototype of this machine 
has been built and is shown in Figure 1. 
The machine, called the Image Under¬ 
standing Architecture (IUA), consists 
of three different, tightly coupled paral¬ 
lel processors: the content addressable 
array parallel processor (CAAPP) at 
the low level, the intermediate commu¬ 
nication associative processor (ICAP) 
at the intermediate level, and the sym¬ 
bolic processing array (SPA) at the high 
level. Figure 2 shows an overview of the 
architecture. The CAAPP and ICAP 
levels are controlled by an array control 


unit (ACU) that takes its directions from 
the SPA level. 

The SPA is a multiple-instruction 
multiple-data (MIMD) parallel proces¬ 
sor, while the intermediate and low lev¬ 
els operate in multiple modes. The 
CAAPP operates in single-instruction 
multiple-data (SIMD) associative or 
multiassociative mode, and the ICAP 
operates in single-program multiple-data 
(SPMD) or MIMD mode. In multiasso¬ 
ciative mode, CAAPP cells execute the 
same instruction stream but in disjoint 
groups, with each group capable of op¬ 
erating on locally broadcast values and 



Figure 1. First-generation prototype of 
the Image Understanding Architecture. 
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Figure 2. Overview of first-generation Image Understanding Architecture. 


locally computing its own summary val¬ 
ues in parallel with all other groups. In 
SPMD mode, the ICAP processors exe¬ 
cute the same program but have their 
own instruction pointers so that they 
can branch independently. 

How does the IUA address the vari¬ 
ous forms of potential parallelism de¬ 
scribed in our introduction? We will 
answer this question by considering the 
capabilities of each level in Figure 2. 
The I/O staging memory permits one or 
more sensors to input images into a 
buffer that can hold up to 15 seconds of 
imagery at 30 frames per second and a 
resolution equal to the size of the low- 
level processor array. The resolution of 
the images can differ from the array 
size, with a resulting increase or de¬ 
crease in the number of frames that can 
be buffered. 

The CAAPP consists of bit-serial pro¬ 
cessors, each with an arithmetic logic 
unit, registers, 320 bits of explicitly man¬ 
aged on-chip cache memory, and 32 Kbits 
of backing store (main) memory. Be¬ 
cause it is a SIMD processor, its instruc¬ 


tions are broadcast from the ACU. 
However, each processor also contains 
a one-bit register that controls whether 
it will respond to a particular instruc¬ 
tion. 

The processors are connected via a 
reconfigurable mesh, called the coterie 
network. Each processor controls four 
switches that configure the mesh con¬ 
nections to its four nearest neighbors 
(north, south, east, west) and four switch¬ 
es that permit signals to bypass the pro¬ 
cessor (northeast, northwest, horizon¬ 
tal, and vertical). When the switches are 
set, connected processors form a cote¬ 
rie. The mesh may simultaneously con¬ 
tain many nonoverlapping coteries. 

Within a coterie, one processor may 
be selected to broadcast a value to the 
members of the coterie, or any subset of 
the processors may send a value bit- 
serially over the network. In the latter 
case, the processors receive the logical 
OR of the bits that were transmitted — 
that is, if some of the processors trans¬ 
mit a 1, then all processors receive a 1; 
but if none of the processors transmits a 


1, then all processors receive 0. This 
some/none test is a valuable summary 
mechanism that can be used in many 
ways. For example, it can be used to 
determine the maximum of a set of val¬ 
ues contained in a coterie. 

If the array has been split into cote¬ 
ries corresponding to regions in an im¬ 
age, then we can use the maximum- 
value operation to label connected 
components. Each processor is merely 
given a unique value (its address) and 
then the maximum-value operation de¬ 
termines the maximum address within 
each coterie. The value is then broad¬ 
cast to the members of the coterie as 
their component label. Note that all of 
this takes place in every coterie simulta¬ 
neously, even though there is only a 
single instruction stream. In the CAAPP, 
connected-components labeling thus 
takes only about 50 microseconds. Many 
other operations on image regions and 
edges can be performed quickly when 
the network is arranged to match their 
shape. The ability to simultaneously 
perform queries and summarize results 
in independent groups of processors 
under a single instruction stream result¬ 
ed in the term multiassociative for this 
mode of parallelism. 

The main memory for the CAAPP is 
also directly accessible to the ICAP 
through a second port. Each ICAP pro¬ 
cessor has access to the 8 x 8 tile of 
CAAPP processors below it, providing 
a highly parallel data path between the 
two levels. Each ICAP processor is a 16- 
bit digital signal processor (DSP) with 
128 Kbytes of program memory and 128 
Kbytes of data memory. We selected a 
DSP because it provides a set of opera¬ 
tions (such as single-cycle square and 
add) that are well suited to computa¬ 
tions in spatial geometry. The DSP is 
also designed for use with a minimum 
amount of external logic, and it pro¬ 
vides a set of communication channels 
that are used for interprocessor com¬ 
munication. As an example of its capa¬ 
bilities, the intermediate level can si¬ 
multaneously match several thousand 
models against symbolic descriptions of 
events (tokens) extracted from an im¬ 
age by the CAAPP. 

The ICAP connects to another dual- 
ported memory, which it shares with the 


66 


COMPUTER 





































// Created by James H. Burrill, University of Massachusetts 

# include “stream.h” 

# include “IuaClassLib.h” 

// Segment ‘intensity_image’ into regions by comparing the values of 
// neighboring pixels. Return the pattern for the virtual Coterie switches. 

CharPlane run_conn_comp(CharPlane &intensity_image) 

{Everywhere active; // Ensure that every pixel participates 
BitPlane temp; 

CharPlane save_connections; 

temp = (intensity_image == intensity _image.West()) & 
~temp.WestEdge_p(); 
save_connections.InsertBits(temp, WL); 

temp = (intensity_image == intensity_image.North()) & 
~temp.NorthEdge_p(); 
save_connections.InsertBits(temp, NL); 

temp = (intensity_image == intensity_image.East()) & 
~temp.EastEdge_p(); 
save_connections.InsertBits(temp, EL); 

temp = (intensity_image == intensity_image.South()) & 
~temp.SouthEdge_p(); 
save_connections.InsertBits(temp, SL); 

return save_connections; 

) 


Figure 3. Example C++ program using the image-plane class library. 


| SPA. Each SPA processor can access 
I data stored in this memory by any ICAP 

I processor. Our current plans are to use 
a commercially available multiproces- 
I sor at this level to provide general-pur- 
I pose computational capabilities for high- 
[ level processing. The SPA also has its 
I own shared memory. The ACU, which 
I manages the CAAPP and ICAP, is con- 
I nected to that memory and communi- 
t cates with the SPA processors as if it 
I were just another processor of the same 

| type. The full-scale IUA can thus pro- 
I cess in parallel all pixels of a single 512 
I x 512 image, several thousand tokens, 
I and up to 64 high-level processes. Sim- 
I ulations of the full-scale IUA have shown 

I that it can support model-based recog- 
I nition tasks at or near frame rate, which 
I is considerably closer to real-time im- 
I age understanding than previous sys- 
I terns. Nonetheless, even greater paral- 
I lelism will be required to achieve true 
I machine perception. (For more infor¬ 
mation on the first-generation IUA, see 
I Weems et al. 2 ) 

Second-generation IUA. A second 
generation of the IUA, currently under 
development, reflects experience from 
the prototype construction, advances in 
machine vision research, and newer 
hardware technology. It retains the over- 
| all three-level structure of the first gen- 
( eration, but the CAAPP and ICAP 
[ levels have been significantly en- 
I hanced. The new hardware implemen- 
I tation will encompass l/16th (16,384 
CAAPP, 64 ICAP, and 4 SPA proces- 
I sors) of a full-scale second-generation 

I system. The second-generation hard- 

I ware will be half the physical size of the 

I prototype IUA, yet will provide rough- 
I ly 10 times the processing power of that 
system. 

In the CAAPP, 256 processors now 
[ reside in a single chip, and each of these 

16 x 16 processor arrays is associated 
with an ICAP processor. Rather than 
treat the I/O staging memory as an I/O 
device, the new CAAPP treats it as 
merely another bank of main memory. 
Greater flexibility has been added to 
the interface with the ICAP as well. 

The ICAP processors now consist of 
32-bit floating-point DSP chips, each of 
which is capable of 50 Mflops. In addi¬ 


tion to the main memory of the CAAPP, 
each ICAP processor will have access to 
1 Mbyte of local memory and 4 Mbytes 
of shared memory within a local cluster 
of four processors. Whereas the first- 
generation prototype connected the 
ICAP processors via a centrally con¬ 
trolled bit-serial crossbar, the 64 proces¬ 
sors in the second generation will be 
fully connected by high-speed direct- 
memory-access channels. The array will 
also support a global shared memory, 
composed of all the local shared memo¬ 
ries, with a hierarchical access mecha¬ 
nism. 

Unlike the minimal ACU in the pro¬ 
totype system, the second generation 
will have a sophisticated controller, de¬ 
signed to support high-level languages 
and virtual processor arrays in the 
CAAPP. We have programmed the pro¬ 
totype CAAPP in Forth and C, using 
high-level syntax extensions to those 
languages that still require the program¬ 
mer to have considerable knowledge of 
the machine’s organization. These lan¬ 
guage extensions are really a halfway 


step between assembly language and 
high-level languages. 

In contrast, the second-generation 
CAAPP will be programmed in stan¬ 
dard C++, using a class library that de¬ 
fines image-plane data types. Programs 
written with the class library can be 
compiled and executed on any machine 
with a standard C++ compiler. To 
execute such programs on the IUA 
merely requires the use of a separate 
runtime library. Figure 3 shows a sam¬ 
ple C++ program for the CAAPP. 

The second-generation ICAP will be 
programmed in C with libraries to sup¬ 
port interprocessor communication. An 
Ada compiler will also be available. A 
symbolic database system to support 
processing, grouping, and matching of 
extracted image events and model parts 
is currently under development for the 
ICAP. 

The SPA will be programmed in yet 
another dialect of C, and a parallel Com¬ 
mon Lisp compiler will be available as 
well. A blackboard system will be avail¬ 
able to support knowledge-based pro- 
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cessing at the high level. One of our 
long-term goals is to develop a single, 
unifying model and language for pro¬ 
gramming the IUA so that program¬ 
mers will not have to distinguish explic¬ 
itly among the three levels. 

The future. Elements of a third-gen¬ 
eration IUA are already under study. 
We expect it to be a transitional step 
between the current three-level organi¬ 
zation with a single low-level array and 
future generations that will incorporate 
multiple, heterogeneous, low-level pro¬ 
cessors called virtual sensors. It may 
also be possible to split the hardware 
into more than three levels and thereby 
represent finer divisions of the abstrac¬ 
tion space in more complex vision ap¬ 
plications. 

Knowledge-based machine vision is 
both complex and computationally in¬ 
tense. It also presents a unique set of 
opportunities for exploiting parallelism. 
The Image Understanding Architecture 
has been built to capitalize on several of 
those sources of potential parallelism. 
Because the capacity for complex par¬ 


allelism in vision is far beyond the capa¬ 
bilities of current technology, parallel 
architectures for vision will continue to 
evolve at the forefront of innovation in 
architectural research.■ 
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D uring the past three decades, 
powerful new methods for im¬ 
age analysis have emerged, 
along with novel architectural concepts 
for the design and construction of mas¬ 
sively parallel machines. These devel¬ 
opments are often motivated by the need 
to process images at high speeds. How¬ 
ever, with some notable exceptions, re¬ 
search on architectures for image un¬ 
derstanding has been driven more by 
classical models of image processing 
(essentially, image-to-image transfor¬ 
mations and global feature extraction) 
than by the more powerful image repre¬ 
sentations and processing methods de¬ 


veloped by the image understanding 
community. 

In this article we consider two exam¬ 
ples from image understanding — fo- 
cus-of-attention vision and contour im¬ 
age analysis — and present new 
parallel-processing methods that effec¬ 
tively support these types of computa¬ 
tions. Our research is a blend of theory 
and practice. On the one hand, we aim 
to develop algorithms whose properties 
are well understood and can be formal¬ 
ly related to key aspects of machine 
models. On the other, we want algo¬ 
rithms that are easy to implement and 
practical in terms of their actual pro¬ 


cessing times on existing parallel ma¬ 
chines. Our experimental research was 
conducted on a 16,384-processor Con¬ 
nection Machine CM2, and we present 
results of algorithm implementations 
on that machine. 

In focus-of-attention vision, we use 
expectations about image structure to 
limit the image’s processing to regions 
expected to contain key image features. 
Focus-of-attention vision is a powerful 
control strategy for image understand¬ 
ing because it lets us limit processing to 
relatively small subsets of an image (es¬ 
pecially critical for sequential imple¬ 
mentations of image understanding sys- 
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terns). Perhaps more importantly, it lets 
us use specialized feature-extraction 
algorithms that are highly tuned by our 
expectations. 

How can we use massively parallel 
computers to build vision systems based 
on focus-of-attention methods? Mas¬ 
sively parallel computers have tens of 
thousands of processors, and focus-of- 
attention vision systems analyze rela¬ 
tively small image windows (typically 
containing only thousands of pixels). 
The classical approach for processing 
images on massively parallel machines 
— assigning each pixel to a processor— 
will leave most of the machine idle. It 
would be preferable to use as many of 
the processors as possible and have the 
time needed to process an image win¬ 
dow be a function of the number of 
pixels in that window. 

We have been studying the use of 
data replication techniques to achieve 
the goal of efficient focus-of-attention 
vision on massively parallel machines. 1 
We replicate the window to be pro¬ 
cessed many times and decompose ba¬ 
sic image analysis operations into 
components that can be computed si¬ 
multaneously on a SIMD machine. We 
briefly describe this research in the next 
section, “Replicated-data algorithms.” 

Contours (extended edges) are im¬ 
portant image structures for both match¬ 
ing and recognition. Many computation¬ 
al stereo models and motion analysis 
models are based on an analysis of the 
geometry of image contours. Addition¬ 
ally, most object recognition systems 
operate by initially reducing the inten¬ 
sity image to a set of contours and then 
matching their geometric properties 
against stored models. 

Contours are marked in images by 
processes such as edge detection and 
thresholding. Although it is possible to 
operate on the contours while they are 
embedded in the two-dimensional im¬ 
age, several reasons make it desirable, 
especially on richly interconnected ma¬ 
chines like the Connection Machine, to 
transform the image contours from their 
image-plane embedding to a linear repre¬ 
sentation. The two primary reasons are 

(1) The original image will ordinarily 
have far more pixels than the number of 




processors in the massively parallel com¬ 
puter. However, the number of pixels 
marked as edges by an edge detector is 
ordinarily only a small percentage of 
the pixels in the image. If we can re¬ 
move the contour pixels from the image 
plane and store them in linear data struc¬ 
tures, then we obviate the need to pro¬ 
cess the “empty” space in the edge im¬ 
age and reduce the “virtual processor 
ratio” (the ratio of the number of logi¬ 
cal processors needed to perform an 
operation to the number of available 
physical processors) to 1. 

(2) Richly connected machines like 
the hypercube-connected Connection 
Machine efficiently support a set of very 
powerful operations called parallel pre¬ 
fix, or scan, operators. These let us com¬ 
pute properties of processor subsets in 
time proportional to the logarithm of 
the subset size. Because of the arbitrary 
pattern of processor addresses encoun¬ 
tered when traversing an image con¬ 
tour, prefix operations cannot be effec¬ 
tively applied to the contours while they 
are embedded in the image plane. Trans¬ 
forming the image contours to a linear 
representation allows us to use these 
prefix operations. 

In summary, processing contours in 
their image-plane embedding makes the 
processing time proportional to the size 
of the image, while operating on them 
in the compact, linearized representa¬ 
tion makes the time proportional to the 
logarithm of the longest contour in the 
image. This is a significant difference. 
The key is to perform the transforma¬ 
tion from the image to the linear repre¬ 
sentation efficiently. This transforma¬ 
tion involves, as a first important step, 
ranking the pixels in each contour. In a 
previous paper 2 we presented one sim¬ 
ple 0 (log N) algorithm for ranking im¬ 
age contours (containing A points) and 
discussed its implementation on the 
Connection Machine. In this article we 
sketch the algorithm and illustrate the 
advantages of linearizing contours by 
considering the problem of piecewise 
linear approximation of contours. 

Replicated-data algorithms. Focus-of- 
attention vision can be supported by an 
approach based on techniques of data 


replication (see Narayanan and Davis 
for more details 1 ). Our approach in¬ 
volves replicating image windows many 
times on the processor array and de¬ 
composing a computation into subtasks 
that are solved simultaneously using the 
copies. The partial results from the cop¬ 
ies are combined to generate the overall 
problem solution. The technique uses 
data parallelism within each copy of the 
data structure and operation parallel¬ 
ism across the copies. The justification 
for this approach is that the number of 
processors is becoming a less critical 
resource in data-parallel computing us¬ 
ing massively parallel SIMD processor 
arrays, and its importance will continue 
to decline. To speed up processing of 
the relatively small data structures that 
arise in focus-of-attention vision on such 
machines, we need to devise techniques 
using a greater number of processors 
than there are data elements in the data 
structure, and divide the task performed 
on each data element among multiple 
processors. 

Data-parallel algorithms depend on 
efficient embedding of the data struc¬ 
ture onto the topology of the processor 
array machine’s interconnection net¬ 
work. In replicated-data algorithms, 
embedding has two dimensions: (1) 
Embedding of the individual copies must 
map proximate data elements to adja¬ 
cent processing elements; (2) corre¬ 
sponding data elements in different cop¬ 
ies should have an efficient inter¬ 
connection pattern among themselves 
for efficient computation across the cop¬ 
ies. The mechanisms provided by the 
machine to distribute the data to the 
different copies and to combine the par¬ 
tial results from the copies are also crit¬ 
ical in the design of a replicated-data 
algorithm. 

We illustrate our technique using dig¬ 
ital image convolution as an example. 
Digital image convolution by a two- 
dimensional kernel of weights is used in 
a variety of operations in image pro¬ 
cessing — for example, in smoothing 
and edge detection. Digital image con¬ 
volution by a k x k kernel, for an odd 
number k, is defined as follows: Assume 
the indices of the kernel range from 
- \k / 2j to \k!2\. Then each pixel P(u, 
v) of the image is mapped to a con- 
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Table 1. Comparison of the convolution algorithms on the Connection Ma¬ 
chine. Timing of convolution is shown in seconds for random image and kernel. 


Image 

Size 

Kernel 

Size 

VP 

Ratio 

Replicated-Data 

Algorithm 

One-Copy 

Algorithm 

64x64 

3x3 

4 

0.014 

0.005 

64x64 

5x5 

8 

0.023 

0.024 

64x64 

7x7 

16 

0.050 

0.091 

64x64 

11 x 11 

32 

0.098 

0.367 

64x64 

15xH5 

64 

0.212 

1.363 

64x64 

21 x21 

128 

0.400 

5.355 


volved value C(u, v) as given in 

k/2 k/2 

C(n,v)= £ £ P(u +i,v + j)K(i,j) 

i=-kk J=-kl2 

where P is the image array and K the 
kernel array. 

The replicated-data algorithm to com¬ 
pute the convolution of an n x n image, 
where n = 2 m ~\ by a k x k kernel of 
weights is given below. The algorithm 
uses k 2 3 4 copies of the image. These cop¬ 
ies can be visualized as forming a two- 
dimensional square array whose indices 
range from - \k / 2 J to [* / 2 J . The ker¬ 
nel weights are distributed one per copy 
such that copy (i,j) gets the value K(i,j) 
for -k/2 < i < k/2 and -k/2 <j< k/2. 

The algorithm’s four steps are 

(1) Using a scan operation with copy 
operator, copy the image stored in copy 
0 to all copies. 

(2) Broadcast the kernel weights to 
the copies in k 2 steps such that all pro¬ 
cessing elements (PEs) of the copy (i,j) 
get the kernel weight K(i, j). 

(3) The PE allocated to pixel ( u , v) of 
copy (t, j) of the image performs the 
following computation: Obtain the pix¬ 
el value from PE (u + i,v+ j) within the 
same copy (assume that all pixels out¬ 
side the image have a gray level of 0) 
and store it in P as its pixel value. Mul¬ 
tiply this value by the kernel weight 
allocated to the copy K(i, j) and store 
this result in C. 

(4) Perform a scan operation with 
“add” as the operator on C across the 
copies of each pixel of the image and 
store the result in a designated copy, say 
copy (0,0). The scan result gives the 
convolved image for each pixel. 


The replicated-data convolution al¬ 
gorithm was implemented on a 16,384- 
processor Connection Machine CM2. 
Table 1 compares the algorithm’s per¬ 
formance with that of a single-copy data- 
parallel algorithm. The virtual proces¬ 
sor capability of the Connection 
Machine, in which each physical PE sim¬ 
ulates multiple PEs, was used in the 
implementation, since the replicated- 
data algorithm needs many more pro¬ 
cessors than the physical array can sup¬ 
port. The number of virtual PEs each 
physical PE simulates is called the vir¬ 
tual processor ratio, or VP ratio (given 
in the third column of the table). The 
fourth column gives the timing of the 
replicated-data algorithm. The timing 
for the single-copy algorithm is given in 
column five when the CM2 was config¬ 
ured to have the same VP ratio as the 
corresponding replicated-data algo¬ 
rithm, thus making the comparison fair. 
The front-end computation overhead 
was negligible in both cases. On the 
whole, the replicated-data algorithm 
achieves impressive speedup over the 
single-copy version for image convolu¬ 
tion operations. 

Replicated-data algorithms reduce 
the computation time by exploiting 
operation parallelism, but they incur 
overhead in distributing the data to the 
copies and in combining the partial re¬ 
sults from them. This overhead is due to 
communications and computations 
across the copies of the data structure 
and depends on the efficiency of those 
operations when multiple copies are 
mapped onto the interconnection net¬ 
work. This efficiency is affected by the 
topology of the interconnection network 
and the time to perform near-neighbor 


communications and arithmetic opera¬ 
tions. 

We analyzed the replicated-data con¬ 
volution algorithm on different com¬ 
mon interconnection networks and de¬ 
termined the results for binary 
hypercubes, the underlying architecture 
of the CM2. The replicated-data algo¬ 
rithm performs better than the single¬ 
copy algorithm on hypercube architec¬ 
tures. For computing the convolution 
over a k x k neighborhood of an n x n 
image using k 2 copies, the speedup is 
given by 


2 t n +t a t h 

2 log k - £ -+-£- 

t m +t a +tn t m +t a +t " 

where t" is the time for a near-neighbor I 
communication, r* is the time for a gen¬ 
eral hypercube communication, t a is the 
time for an addition on the machine, 
and t m the time for a multiplication. 
Figure 1 compares the speedup predict¬ 
ed by the above equation with the speed¬ 
up obtained in practice on the CM2. 

There are other low-level-vision op¬ 
erations that benefit from the technique 
of data replication. Previously, we pre¬ 
sented analysis and implementation re¬ 
sults of the replicated histogram algo¬ 
rithm. 1 Table lookup, Hough transform 
computations, and co-occurrence ma¬ 
trix computations are some of the other 
operations that can be speeded up using 
techniques similar to those used in the 
histogram algorithm. 

Image algebra is an architecture-in¬ 
dependent language that can describe a 
large class of image operations. Convo¬ 
lution belongs to the class of general¬ 
ized template operations defined in 
image algebra. We developed a method 
to automatically generate a replicated- 
data algorithm for any image operation 
that can be described in terms of a gen¬ 
eralized template operation. 1 We also 
developed replicated-data algorithms for 
rank-order filters, which are local non¬ 
linear image operations. Rank-order fil¬ 
ters are expensive to compute on SIMD 
machines, as they involve independent 
sorting of the neighborhood pixels within 
each processor. Sorting can be per¬ 
formed quickly if the neighborhood el¬ 
ements are distributed among many 
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processors using the sorting algorithms 
of the underlying interconnection net¬ 
work. For instance, k neighborhood pix¬ 
els can be sorted in a replicated scenar¬ 
io on the hypercube in 0(log 2 k) time, 
whereas independent sorting on each 
PE takes 0(k z ) time on SIMD architec¬ 
tures. By assigning different areas of 
the search space to different copies of 
the data structure, we are extending the 
replication technique to problems in 
intermediate- and high-level vision that 
contain a combinatorial component. 

Image contour analysis. We turn now 
to problems associated with the effi¬ 
cient processing of image contours us¬ 
ing hypercube-connected massively 
parallel computers. An important ad¬ 
vantage of hypercube machines like the 
Connection Machine over the more com¬ 
mon mesh network is that one can effi¬ 
ciently compute parallel prefix opera¬ 
tions using the hypercube network. 
However, such operations can be ap¬ 
plied only to a sequence of processors in 
which each processor can randomly ac¬ 
cess information from processors dis¬ 
tance 2' away from it, where i is an 
integer greater than zero. Since a con¬ 
tour can wind freely through an image, 
the sequence of processor addresses 
associated with the pixels on a contour 
will not generally have this property. 
Below we present an efficient algorithm 
for ranking the pixels on a contour. 
Once the pixels are ranked, the contour 
can be moved to a new set of processors 
whose addresses will form a monotonic 
continuous sequence (by simply mov¬ 
ing the ith contour element to processor 
i ), thus allowing random access between 
processors. The algorithm we present 
runs in 0(log N) time (where N is the 
length of the contour) on either an ex¬ 
clusive read, exclusive write (EREW) 
parallel random access machine or a 
distributed-memory machine with 
EREW ability between memory mod¬ 
ules. (See the literature for more de¬ 
tails. 2 ) 

The problem is as follows: We are 
given an NXN binary image that con¬ 
tains only thin curves. Each pixel on 
each curve has exactly two neighboring 
pixels adjacent to it, with the exception 
of endpoints, which have only one neigh¬ 


bor. Closed curves (curves that form a 
loop and have no endpoints) can also 
occur in the image. Our goal is to list 
rank all the curves in the image in par¬ 
allel such that all open curves have one 
of their endpoints marked as the head 
and all other pixels on that curve deter¬ 
mine their distance (along the curve) 
from that endpoint. For closed curves, 
any arbitrary pixel can be chosen as the 
head, as long as all pixels on the curve 
agree on which pixel is the head and 
every pixel determines its distance from 
that head in a consistent direction (clock¬ 
wise or counterclockwise). 

The algorithm starts with two point¬ 
ers, PI and P2, pointing to neighboring 
pixels. The algorithm uses these point¬ 
ers to form an Euler tour path within 
each curve. Figure 2 illustrates this for a 
five-pixel open curve. If the Euler tour 
path is followed around an open curve, 
each pixel on the curve will be visited 
exactly twice, while the two endpoints 
are visited only once. Let’s call each 
element (pointer plus distance value to 
be computed) in the 
Euler tour path a node. 

The Euler tour is easily 
initialized. 

After the Euler tour 
path is initialized, each 
pointer in the path re¬ 
peatedly does pointer 
jumping (distance dou¬ 
bling) while remember¬ 
ing the maximum node 


address seen so far, as well as the dis¬ 
tance along the tour path to this maxi¬ 
mal node. In Figure 2, node 8 would be 
the node with the maximum address. 
Thus, after pointer jumping has termi¬ 
nated, all nodes in the Euler tour path 
will have identified node 8 as having the 
maximum address and will have com¬ 
puted the distance along the path to 
node 8. At this point, a list ranking for 
the open curve can be computed by 
simply having each pixel compute the 
minimum value of the distance to node 
8 for the two nodes associated with that 
pixel. 

Many details have been omitted here, 
such as how to initiate the Euler tour 
path, when to terminate the pointer¬ 
jumping loop, and the changes neces¬ 
sary to handle closed curves; a more 
detailed discussion is available. 2 

The list-ranking algorithm was im¬ 
plemented on the Connection Machine. 
Table 2 shows the results obtained by 
running our EREW 0(log N ) list-rank¬ 
ing algorithm on the CM2 for different 


Pixel: 1 

Pointer 1: ©-< 

Pointer 2: 

List ranking: 4 


-©-*—© ©- 


Figure 2. Example of Euler tour path on an open 
curve with five pixels. 
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Table 2. Result of CRCW and EREW Q (log N) time algorithm versus O(N) time algorithm (virtual processor ratio = 


Curve 

Length 


2,048 

4,096 


CRCW Algorithm 


Iterations Time (ms) Iterations 


EREW Algorithm 


Time (ms) 


726 

1,162 


O(N) Time Algorithm 

Iterations Time (ms) 

64 605 

128 1,219 

256 2,438 

512 4,859 

1,024 9,696 

2,048 19,410 

4,096 38,827 


longest curve lengths. An image can 
contain many curves, and the algorithm 
will rank them all simultaneously. How¬ 
ever, the time needed to rank the entire 
set of image curves is determined by the 
length of the longest curve in the image. 
As a comparison, the running times re¬ 
quired by the concurrent read, concur¬ 
rent write (CRCW) algorithm, present¬ 
ed elsewhere, 2 are also shown, as are 
times for a trivial linear-time algorithm 
that propagates the list-ranking infor¬ 
mation along each curve pixel by pixel. 
We can see that the EREW algorithm is 
slower than the CRCW algorithm, but 
both are much faster than the linear¬ 
time algorithm. The algorithms were 
applied to a 512 x 512 image using 8,192 
physical processors; thus, the VP ratio 
was 8 for all these experiments. 

Next, we use piecewise linear approx¬ 
imation of curves as an example to illus¬ 
trate how contours can be processed 
efficiently once they are linearized. Peu- 
cker devised a method for finding piece- 
wise linear approximations of curves by 
breaking curves at points that are far¬ 
thest from the line that connects the two 
endpoints of the curve. By repeatedly 
applying this curve-breaking method 
until all pixels of each curve are within 
a threshold distance from the line con¬ 
necting the endpoints, we can obtain a 


good piecewise linear approximation. 

This algorithm can be implemented 
easily on our monotone contiguous 
mapping between pixels and processors. 
The algorithm involves the following 
steps: 

(1) Perform a reverse first scan on 
the x and y coordinates, so that the first 
processor of each curve segment has the 
x, y coordinates of the two endpoints of 
the segment. 

(2) The first processor of each curve 
calculates the coefficients of the line 
that passes through its endpoints. 

(3) A forward first scan is performed 
to broadcast the coefficients of this line 
to all pixels in this curve. 

(4) All pixels calculate, in parallel, 
the distance between themselves and 
the line joining the endpoints. 

(5) A reverse max scan is performed 
on this distance concatenated with the 
processor ID number of each pixel. This 
results in the first processor in each 
segment’s knowing the processor ID 
number of the largest address processor 
having maximal distance from the line. 
Let m, be the address of the processor 
having maximal distance in curve seg¬ 
ment i. 

(6) If this maximum distance is small¬ 
er than a threshold, the segment de¬ 


selects itself and is idle through steps 7 
and 8. If all segments in the image de¬ 
select, the algorithm terminates. 

(7) A forward first scan is used to 
broadcast m t to all processors in curve 
segment i. 

(8) Processor m, sets its segment flag 
to “True,” thus splitting curve segment 
i for the next iteration. 

(9) Steps 1 through 8 are repeated 
until the algorithm finally terminates at 
step 6. 

We applied this piecewise linear ap¬ 
proximation algorithm to three differ¬ 
ent 512 x 512 test images using the scan 
instructions available on the CM2. Ta¬ 
ble 3 shows the results. Edge detection 
and the list-ranking algorithm were ap¬ 
plied to the three images, and the re¬ 
maining pixels were packed into mono¬ 
tone contiguous processors. At this point 
each image contained C start contours, 
and after applying the algorithm, each 
contained C end linear segments. We can 
see that typical images will require five 
to six iterations of the algorithm before 
termination. On the CM2, each itera¬ 
tion takes roughly 7 milliseconds. 

We are working to develop practical 
curve-matching algorithms, as well as 
stereo-matching algorithms. We have 
also worked on an efficient parallel al¬ 
gorithm for computing the visibility 
graph of a polygon by using only paral¬ 
lel prefix operations for communica¬ 
tion (closed curves can be transformed 
into polygons by the piecewise linear 
approximation algorithm). 

The research discussed here focuses 
on the effective use of massively paral¬ 
lel computation for representative prob¬ 
lems in intermediate-level vision. One 


Table 3. Result of applying the algorithm. 


Total 



Splitting Algorithm 


Processors 

Pixels 

c start 

C e „d 

Iterations 

Time (ms) 

8,192 

7,706 

581 

800 

5 

36 

8,192 

6,735 

515 

700 

5 

37 

8,192 

7,943 

630 

877 

6 

46 
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of the greatest challenges facing the 
image understanding community is to 
discover how to use parallelism to ad¬ 
dress problems in high-level vision — 
that is, image interpretation and scene 
analysis. While image understanding 
itself is the least developed aspect of the 
field, we can see several architectural 
solutions emerging in the current de¬ 
cade. These include the use of hetero¬ 
geneous but tightly coupled systems like 
the Image Understanding Architecture, 
which attempts to capture one of the 
basic image understanding paradigms 
of the 1980s, and the use of homoge¬ 
neous massively parallel systems, which 
use a single computational paradigm 
(for example, neural computing, con- 
nectionism, constrained combinatorial 
analysis) to address high-level-vision 
problems. These and alternative vision 


architectures deserve the attention of 
vision researchers in the 1990s.B 
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W e are developing a soft¬ 
ware environment tailored 
to computer vision and im¬ 
age processing (CVIP). Although ob¬ 
taining highest performance on parallel 
systems will almost certainly 
require sophisticated knowl¬ 
edge of parallel processing 
(for example, see Stout 1 ), it is 
both unrealistic and undesir¬ 
able to expect a researcher in 
the CVIP area to be an expert 
in parallel problem-solving 
techniques or parallel archi¬ 
tectures. It is essential to pro¬ 
vide tools that let applications 
researchers achieve reason¬ 
ably high performance at a 
reasonable level of program¬ 
ming effort. The software en¬ 
vironment focuses on how in¬ 
formation about the CVIP 


problem domain can make the high- 
performance algorithms and the sophis¬ 
ticated algorithm techniques being de¬ 
signed by algorithms experts more 
readily available to CVIP researchers. 


The software environment consists of 
three principal components — DISC, 
Cloner, and Graph Matcher—shown in 
Figure 1. At the heart of the environ¬ 
ment, and key to the operation of all 
three components, is a set of 
algorithm libraries, along with 
a metalevel of algorithm char¬ 
acteristics that abstract infor¬ 
mation about the library pro¬ 
grams. The environment also 
includes traditional compilers, 
debuggers, and operating sys¬ 
tems components. However, 
our goal is to exploit the spe¬ 
cial characteristics of CVIP to 
achieve easier algorithm de¬ 
velopment and better perfor¬ 
mance than can be expected 
with general-purpose tools; 
therefore, we focus on the sub¬ 
systems in Figure 1. Each com- 



Figure 1. Overview of the software environment for 
computer vision and image processing. 
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ponent addresses a different aspect of 
the problem of rapid prototyping for 
CVIP algorithms and tasks: 

• DISC (dynamic intelligent schedul¬ 
ing and control) supports experimenta¬ 
tion at the CVIP task level by creating a 
dynamic schedule from a user’s specifi¬ 
cation of the algorithms that constitute 
a complex task. 

• Cloner is aimed at the algorithm 
development process and is an interac¬ 
tive system that helps a user design new 
parallel algorithms by building on and 
modifying existing library algorithms. 

• Graph Matcher performs the criti¬ 
cal step of mapping new algorithms onto 
the target parallel architecture. 

We have completed initial implementa¬ 
tions of DISC 2 and Graph Matcher 3 ; 
work on Cloner is in progress. The re¬ 
mainder of this article summarizes the 
components of the CVIP software envi¬ 
ronment. 

DISC: A dynamic scheduler for exe¬ 
cuting computer vision tasks. The DISC 
system is designed to facilitate system 
prototyping, the experimental process 
during which a user tests strategies for 
performing a complex task by trying 
different component algorithms, differ¬ 
ent orderings of algorithms, and differ¬ 
ent strategies for controlling the selec¬ 
tion and sequencing of algorithms. DISC 
is implemented as an expert system that 
uses a library of low-, mid-, and high- 
level-vision algorithms and alternative 
parallel implementations, a database of 
execution characteristics of CVIP algo¬ 
rithms, rule-based heuristics, and the 
current system state to produce and con¬ 
tinually update a schedule for the sub¬ 
tasks (algorithms) that constitute the 
overall task. The scheduler keeps track 
of what subtasks are potentially execut¬ 
able and chooses the best candidate by 
considering the relative importance of 
finishing the subtask quickly and the 
extent to which the current allocation 
of data in the machine partitions (sub¬ 
sets of processing elements) matches 
the data allocation needed by the sub¬ 
task. DISC also controls repartitioning 
and compaction of the system. 

Figure 2 is a graph representing data 


dependencies among the algorithms con¬ 
stituting a sample task. The primitives 
of the DISC language are the library 
algorithms. The graph is derived from a 
sequential listing of the algorithms and 
their arguments. The data dependen¬ 
cies are derived from the input/output 
specifications for the parameters for each 
algorithm. Once a subtask is chosen for 
execution, the scheduler selects the most 
suitable implementation of that subtask 
from the library. Implementations may 
differ by 

• the way data is allocated to process¬ 
ing elements (for example, pixel data 
allocated by rows versus by square 
subimages, and contours allocated 
by object versus by coordinates); 

• the format of the input and output 


Algorithm 

characteristics 

The ability to characterize algo¬ 
rithms is key to the DISC and Clon¬ 
er systems. The characteristics 
used are derived from the following 
general set of parallel-algorithm 
characteristics: 

• Nature of parallelism: data or 
function. 

• Data granularity or module 
granularity: the size of the data 
items processed as a unit or the 
size of independent modules. 

• Degree of parallelism. 

• Uniformity of operations, ex¬ 
pressed as the smallest data granu¬ 
larity at which uniform operations 
are performed. 

• Synchronization requirements 
and precedence constraints. 

• Static/dynamic character of the 
algorithm, in terms of the pattern of 
process generation and termination. 

• Data dependencies and related 
issues of data allocation and mem¬ 
ory access patterns. 

• Five characteristics shared by 
serial and parallel algorithms: fun¬ 
damental operations, data types 
and precision, memory require¬ 
ments, data structures, and I/O. 


parameters (for example, binary im¬ 
age versus edge list); 

•mode (single instruction, multiple 
data, or SIMD; or multiple instruc¬ 
tion, multiple data, or MIMD); and 
• range of number of processors us¬ 
able by the implementation. 

DISC selects an implementation based 
on how well its characteristics coincide 
with the current data allocation, data 
format, and mode of the chosen system 
partition, and based on the expected 
relative speedup for the size of the par¬ 
tition. The scheduling is performed dy¬ 
namically to handle situations common 
in vision applications: algorithms for 
which the execution time depends on 
the input image (for example, boundary 
tracing) and tasks in which the actual 
sequence of algorithms executed may 
vary depending on characteristics of the 
image (for example, the Edge linking/ 
Edge continuity test loop in Figure 2). 

DISC has been implemented with the 
PASM (partitionable SIMD/MIMD) 
parallel-processing system as its target 
architecture. Evaluation to date has 
consisted of simulations of tasks cover¬ 
ing a spectrum of dependency graphs. 
In these tests, each library algorithm 
was represented by at least 12 imple¬ 
mentations — typically two or three 
different implementation strategies ex¬ 
ecutable on six different partition sizes. 
For algorithms with image-dependent 
execution times, the simulated times 
were randomized by the simulation con¬ 
troller, and at least 100 runs were per¬ 
formed. 

Performance was evaluated using 
measures including utilization and sched¬ 
uling overhead. Utilization is measured 
as the percentage of the processor-time 
space during which processors are not 
idle. Figure 3 shows a processor-time 
diagram for the task in Figure 2. Over 
the nontrivial tasks on which DISC has 
been tested, a 77-percent average utili¬ 
zation was achieved, and on the tasks in 
the test suite for which the optimal sched¬ 
ule was known, DISC obtained the op¬ 
timal schedule. 

Scheduling overhead is used to assess 
the amount of overhead DISC incurs in 
creating a schedule. The scheduling over¬ 
head is counted as the number of rules 
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Figure 2. Algorithm data-dependency graph for a CVIP task Figure 3. Example processor-time diagram for the task in 

to locate tanks in forward-looking infrared radar images. Figure 2. (Time axis is not to scale.) 


fired to begin the execution of an imple¬ 
mentation of each algorithm. Using re¬ 
alistic assumptions to relate simulation 
time units and real time, scheduling 
overhead was measured to be less 
than 0.1 percent of the overall execu¬ 
tion time. 

Current work on DISC includes ex¬ 
pansion of the library, refinement of the 
scheduling heuristics, performance anal¬ 
ysis based on stochastic modeling of 
image-dependent execution character¬ 
istics, and packaging as a tool portable 
to a variety of parallel architectures. 

Cloner: An interactive environment 
for developing parallel algorithms. Clon¬ 
er is a software reuse tool that helps a 
user design parallel algorithms by build¬ 
ing on and modifying existing library 
algorithms. It takes advantage of the 
fact that CVIP algorithms, especially 
for low-level vision, are often highly 
structured and that many image- and 
graph-based algorithms share the same, 
or a similar, structure. Cloner’s dual 
goals are rapid prototyping and im¬ 
proved code quality. 

The library forms the heart of Cloner. 
The user is provided with information 


about what algorithms and programs 
the system “knows about” and is guided 
through the process of developing a new 
program by relating it to existing pro¬ 
grams. This may be done by a combina¬ 
tion of operations, including the adop¬ 
tion and/or modification of library data 
abstractions (objects), the composition 
of library code segments, and the 
modification of library code templates. 
Emphasis is on providing a character¬ 
ization of the library algorithms and 
kernels to act as an interface between 
the library and the user. Cloner exploits 
reusable code at several levels: 

• the data-structure/data-abstraction 
level, 

• the control structure level, and 

• the algorithm level. 

Interaction is via menus and queries 
that lead the user through an adaptive 
series of displays and questions designed 
to let either the user or Cloner select 
code templates. The questions are based 
on characteristics of the algorithm in 
terms of attributes such as data versus 
function parallelism and dominant data 
structures. Graphical displays show data- 


dependency patterns associated with a 
library program or kernel (for example, 
an operation structured around a 3-by- 
3-pixel window) and highlight potential 
reusable code fragments (for example, 
an optimized looping structure displayed 
to delimit the control structure and the 
replaceable loop body). 

We are now building Xcloner, an X 
window-based implementation of Clon¬ 
er. Figure 4 on the next page shows an 
example of the main Xcloner menu for 
image-processing operations. The cate¬ 
gories in the main menu bar identify the 
major operations provided by Xcloner. 
Each category provides a pull-down 
menu. At the highest level, Xcloner is 
machine independent. Information spe¬ 
cific to the target parallel system resides 
within the menu options and in the cod¬ 
ing and mapping of the library algo¬ 
rithms. The initial implementation fo¬ 
cuses on image processing and 
low-level-vision algorithms because the 
obvious common structures for these 
algorithms facilitate the development 
of the user interface and library access 
tools. However, many algorithms per¬ 
formed in mid- and high-level vision use 
well-defined structures such as graphs 


February 1992 


75 



























































PROJECT OVERVIEWS 



Figure 4. Example of Cloner display for image-processing algorithms. 
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Figure 5. Dependency graph (a) and hypergraph (b) for a histogramming algo¬ 
rithm. N = number of pixels. Hypernode labels show the number of DG nodes 
combined to form the hypernode. Hyperedge labels characterize the pattern of 
the edges merged to form the hyperedge: D = Distribute (one to many); P = 
Parallel (one to one). 


to represent regions and objects. Fu¬ 
ture work includes expansion of Cloner 
to include these algorithms. 

Graph Matcher: Mapping CVIP al¬ 
gorithms onto parallel architectures. If 

a user has specified a new algorithm 


directly (for example, by a data-depen- 
dency graph or dataflow graph, or as a 
program that is transformed to such a 
graph), then the critical step in mapping 
the algorithm onto the parallel archi¬ 
tecture involves the assignment of algo¬ 
rithm steps to processors. The objective 


is to equalize the work done by the 
processors and to minimize overhead 
from interprocessor communication. 
General-purpose compilers and map¬ 
ping tools are designed to perform this 
function. However, in many algorithms 
with regular structure, it is possible to 
exploit prior experience to obtain map¬ 
pings that might be difficult or prohibi¬ 
tively time-consuming to derive using 
general-purpose tools. 

Graph Matcher uses pattern-match¬ 
ing techniques to recognize the data- 
dependency structure of a new algo¬ 
rithm as an instance of a dependency 
structure for which an algorithm-to-ar- 
chitecture mapping is already known. 
At the data-dependency level, especial¬ 
ly in low-level vision and image pro¬ 
cessing, many algorithms share commu¬ 
nications patterns: 

•patterns typical for window-based 
algorithms, 

• patterns typical for block-based al¬ 
gorithms, 

• patterns characteristic of transforms, 
and 

• patterns typical for collecting image 
statistics. 

At the process level, algorithms based 
on the same paradigm (for example, 
divide and conquer) may exhibit similar 
communications requirements. 

Graph Matcher consists of a library 
of known data-dependency structures 
and mappings of these structures onto 
architecture configurations. The input 
to Graph Matcher is a directed acyclic 
graph representing a new algorithm. The 
process of matching the input depen¬ 
dency graph (DG) to one of the library 
graphs can be formulated as a graph 
isomorphism problem; however, the 
complexity of graph isomorphism makes 
direct and exhaustive comparison to the 
library infeasible. Graph Matcher there¬ 
fore relies on heuristics that use easy- 
to-compute graph attributes and that 
take advantage of the regular structure 
of many image-processing algorithms. 

In a candidate elimination step, easi¬ 
ly measured features of the input DG 
are compared to features of the library 
DGs to eliminate library graphs from 
further consideration. Features include 
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Table 1. Comparison of dependency-graph sizes and hypergraph sizes. 




No. of 

No. of 

No. of 

No. of 

Algorithm 

Parameters 

Nodes 

Edges 

Hypernodes 

Hyperedges 

Histogram 

No. of pixels: N 

3+2 N 

31V +1 

5 

4 

Hough 

No. of pixels: N 

2N + P + 2 

NP+N+P +1 

5 

4 

transform 

No. of parameters: P 





Threshold 

No. of pixels: N 

3N + 1 

3 N 

4 

3 

Smoothing 

8-connected 

11 

10 

3 

2 

Erosion 
and dilation 

8-connected 

10 

9 

3 

2 


graph size, simple connectivity proper¬ 
ties, and properties of particular verti¬ 
ces. Isomorphism testing against the 
remaining library graphs uses an ap¬ 
proach based on hypergraphs. Attrib¬ 
uted hypergraphs have been used for 
three-dimensional object recognition. 
For the mapping problem, they are used 
to reduce the size of regular DGs by 
letting a single hypergraph node (a hy¬ 
pernode) represent a set of nodes and 
letting hyperedges represent connec¬ 
tivity between hypernodes. 

Figure 5 shows the data-dependency 
graph and hypergraph for a histogram- 
ming algorithm. Table 1 compares the 
DG sizes and hypergraph sizes for a 
number of algorithms. The use of hy¬ 
pergraphs does not alter the asymptotic 
time complexity of graph isomorphism. 
However, it can yield graphs of such 
reduced size that the test for isomor¬ 
phism becomes feasible. For graphs with 
little or no regular structure, the hyper¬ 
graph approach may yield no reduction 
in graph size, and therefore no savings 
in the isomorphism testing. However, 
for graphs with regular structure, sub¬ 
stantial savings are realized. 

We have proven that the heuristic 
procedure for generating hypergraphs 
from DGs preserves isomorphism. Un¬ 
der conservative assumptions, the algo¬ 
rithm has worst-case time complexity 
0(n 4 ), where n is the number of nodes in 
the original DG. Under realistic assump¬ 
tions the complexity is 0(n 2 ). 

Graph Matcher has been implement¬ 
ed and tested on the Purdue Image Pro¬ 


cessing Library. Areas of future work 
on Graph Matcher include incorpora¬ 
tion of techniques to calibrate “close¬ 
ness” of matches between library and 
input DGs, and the use of these mea¬ 
sures to apply Graph Matcher to high- 
level-vision algorithms where graph sim¬ 
ilarity is a more appropriate criterion 
than graph isomorphism. 

Conclusions. Anecdotal reports 
abound about researchers with scientif¬ 
ic and engineering problems who have 
tried to make use of parallel-processing 
systems, and who have been almost fa¬ 
tally frustrated in the attempt. Our ex¬ 
perience with parallel-algorithm design 
suggests that the regular structure of 
many CVIP algorithms can form the 
basis for an effective specialized paral¬ 
lel-programming environment. Our goal 
is to exploit the special characteristics 
of CVIP to achieve easier algorithm 
development and better performance 
than can be expected with general-pur¬ 
pose tools. DISC and Graph Matcher 
and the in-progress Cloner are “proof- 
of-concept” implementations demon¬ 
strating the effectiveness of this special¬ 
ized environment.■ 
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Compcon afternoon session to open with computing pioneers 


Winners of major IEEE Computer 
Society awards will share experiences 
from their pioneering work during ac¬ 
ceptance speeches at Compcon Spring 
92 in San Francisco. The society’s 
awards and historical perspectives ses¬ 
sion is scheduled for Wednesday, Feb¬ 
ruary 26,1992, from 1:00 to 2:00 p.m. 
Computer Society President Bruce D. 
Shriver will chair the session, assisted 
by Awards Chair Ralph J. Preiss. The 
following awards will be presented; 

1991 Richard E. Merwin Award — 
Martha Sloan, for service to the com¬ 
puter profession. Sloan, professor of 
electrical engineering at Michigan 
Technological University, was 1984-85 
Computer Society president and was 
recently elected 1992 IEEE president. 

1990 Computer Pioneer — Werner 
Buchholz, for computer architecture. 
Buchholz worked for IBM for 40 
years and contributed to the architec¬ 
ture of IBM’s earliest large-scale com¬ 
mercial computer systems, the IBM 
701 and 702, as well as the Stretch su¬ 
percomputer. His latest contribution 
was the vector facilities for the Sys¬ 
tem/370 and System/390. He was also 
the first editor of the Institute of Ra¬ 
dio Engineers’ Transactions of the 
Professional Group on Electronic 
Computers, the predecessor of the 
current IEEE Transactions on Com¬ 
puters. 

1991 Computer Pioneer — Bob O. 
Evans, for compatible computers. 
Evans is currently a partner and exec¬ 
utive vice president of Technology 
Strategies and Alliances in Menlo 
Park, California. He retired from IBM 
as vice president and chief engineer in 
1984. In the early 1960s he led the lab¬ 
oratory that had worldwide responsi¬ 
bility for preparing IBM for the Sys¬ 
tem/360 revolution — one system 
architecture from the smallest to the 
largest data processing system. 

1991 Computer Pioneer — Thomas 
E. Kurtz, for Basic. Kurtz, long associ¬ 
ated with Dartmouth College, is being 


recognized for his contributions to 
Basic, the language used at Dart¬ 
mouth to train students in computing 
and which took off with the personal 
computer to gain worldwide promi¬ 
nence as the language in which thou¬ 
sands of PC programs were written. 

1991 Computer Pioneer — Robert 
W. Floyd, for early compilers. Floyd, 
who still teaches at Stanford Universi¬ 
ty, implemented one of the earliest 
Algol 60 compilers, along with meth¬ 
ods for compiler optimization. He lat¬ 
er worked on mathematical methods 
for proving the correctness of pro¬ 
grams, and on many fast algorithms. 

1990 W. Wallace McDowell Award 
— Lawrence G. Roberts, for ARPA- 
net. While working on networking 
problems at Lincoln Laboratory, Rob¬ 
erts was asked by the US Defense De¬ 
partment’s Advanced Research 
Projects Agency to use his talents on 
its behalf. DARPA funded not only 
much of the work at Lincoln Labs, but 
also many university research 
projects, each requiring a computer to 


The IEEE Board of Directors has 
completed its annual election for 1991 
and conferred the title of IEEE fellow 
on 65 Computer Society members. A 
total of 522 nominations were made, 
from which 244 were elected. Com¬ 
puter Society members accounted for 
26.6 percent of the new fellows, up 
from 21.1 percent in 1991 and 22.6 
percent in 1990. The IEEE fellow des¬ 
ignation was effective as of January 1, 
1992. 

The title of fellow recognizes out¬ 
standing contributions of senior mem¬ 
bers in one or more areas. The alpha¬ 
betical list below identifies the new 
IEEE fellows who are Computer Soci¬ 
ety members and the citation each 


carry out the research. The desire to 
have computers and researchers at 
different universities collaborate in 
real time led to the design and appli¬ 
cation of packet switching — and 
ARPAnet. 

1991 IEEE Emanuel R. Piore 
Award — Harold S. Stone, for parallel 
processor interconnections. Stone is 
well known for having shown the 
“perfect shuffle” to be fundamental 
for parallel processor communica¬ 
tions. This, together with his invention 
of recursive doubling, opened the way 
for successful parallel processing us¬ 
ing many computing elements. 

“With the rapid growth of computer 
technology, it is easy to lose sight of 
the innovations that helped us reach 
our current level of technical sophisti¬ 
cation,” said Computer Society 
Awards Chair Ralph Preiss. “Our 
awards program not only recognizes 
significant achievements, but also 
gives other professionals a chance to 
meet and hear from the personalities 
whose work helped propel the careers 
of so many others.” 


new fellow received. The IEEE soci¬ 
ety, group, or council that evaluated 
the respective candidate appears in 
parentheses. 

Bryan D. Ackland, Holmdel, New Jersey, 
for contributions to the design of custom 
integrated circuits for signal processing 
systems (Circuits and Systems). 

Vinod K. Agarwal, Montreal, Canada, for 
contributions to built-in self-tests of digital 
systems and fault-tolerant computing 
(Computer). 

Syed V. Ahamed, Somerset, New Jersey, 
for contributions to the study and design of 
digital subscriber lines (Communications). 


IEEE elects 65 Computer Society members 
to fellow grade 
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Narendra Ahuja, Urbana, Illinois, for con¬ 
tributions to three-dimensional computer 
vision (Computer). 

Dennis J. Allan. Stafford, England, for 
contributions to the design and develop¬ 
ment of power transformers (Power Engi¬ 
neering). 

Charles C. Allen, Berwyn, Pennsylvania, 
for leadership in the development and ap¬ 
plication of antennas for radar and com¬ 
munication systems (Aerospace and Elec¬ 
tronic Systems). 

Akihiro Ametani, Kyoto, Japan, for contri¬ 
butions to the analysis of electrical tran¬ 
sients in power systems (Power Engineer¬ 
ing). 

Jean-Loup Baer, Seattle, Washington, for 
contributions to the design and analysis of 
parallel computer systems (Computer). 

Ruzena K. Bajcsy, Philadelphia, Pennsyl¬ 
vania, for contributions to machine percep¬ 
tion and robotics (Robotics and Automa- 

Paul H. Bardell Jr., Carmel, New York, for 
contributions to built-in self-tests of digital 
systems (Computer). 

James C. Bezdek, Pensacola, Florida, for 
contributions to the theory and applica¬ 
tions of classifier design and clustering al¬ 
gorithms using pattern recognition tech¬ 
niques based on fuzzy models (Systems, 
Man, and Cybernetics). 

Michael W. Blasgen, Yorktown Heights, 
New York, for leadership in the research 
of RISC architectures and relational data¬ 
bases (Computer). 

Barry W. Boehm, Arlington, Virginia, for 
contributions to the architecture, model¬ 
ing, and implementation of software envi¬ 
ronments (Computer). 

John R. Brews, Tucson, Arizona, for con¬ 
tributions to the theory of semiconductor 
devices, particularly the MOSFET (Elec¬ 
tron Devices). 

Ramalingam Chellappa, College Park, 
Maryland, for contributions to statistical 
model-based approaches for two-dimen¬ 
sional spectral analysis and image process¬ 
ing (Signal Processing). 

Joe Hong Chow, Scotia, New York, for 
contributions to singular perturbation the¬ 
ory and its application to control and pow¬ 
er systems (Control Systems). 

Paolo Corona, Naples, Italy, for the devel¬ 
opment of continuous-mode stirred cham¬ 
bers, and their application in electromag¬ 
netic compatibility evaluations (Electro¬ 
magnetic Compatibility). 

J. Ronald Cricchi, Catonsville, Maryland, 
for contributions to the development of 
semiconductor-device technology, particu¬ 


larly nonvolatile semiconductor memory 
(Electron Devices). .* 

Richard P. Flam, Horsham, Pennsylvania, 
for contributions to the design and applica¬ 
tion of broadband antennas (Antennas and 
Propagation). 

Warren D. Grobman, Yorktown Heights, 
New York, for contributions to electron- 
beam and X-ray microlithography and 
their practical applications (Electron De¬ 
vices). 

Narendra K. Gupta, Santa Clara, Califor¬ 
nia, for entrepreneurship and leadership in 
the development of software and hardware 
products for the design and implementa¬ 
tion of real-time control systems (Control 
Systems). 

Tatsuo Higuchi, Sendai, Japan, for contri¬ 
butions to the theory of multidimensional 
signal processing and the development of 
the beyond-binary VLSI signal processing 
technique (Circuits and Systems). 

Takamasa Hori, Mie-ken, Japan, for con¬ 
tributions to the development of speed- 
control systems for industrial alternating- 
current motors (Industrial Electronics). 

Oscar H. Ibarra, Santa Barbara, California, 
for contributions to the design and analysis 
of algorithms, the theory of computation, 
computational complexity, and parallel 
computing (Computer). 

Hideki Imai, Yokohama, Japan, for contri¬ 
butions to the theory of coded modulation 
and two-dimensional codes (Information 
Theory). 

Lawrence D. Jacket. Holmdel, New Jersey, 
for leadership in the application of neural 
networks to pattern recognition and in the 
development of electronic systems imple¬ 
menting neural networks (Neural Net- 

Ramesh C. Jain, Ann Arbor, Michigan, for 
contributions to computer vision in the es¬ 
timation of structure and motion from im¬ 
ages (Computer). 

Raymond A. Jarvis, Victoria, Australia, 
for leadership in and contributions to the 
fields of computer vision and intelligent ro¬ 
botics (Robotics and Automation). 

Takeo Kanade, Pittsburgh, Pennsylvania, 
for contributions to vision, manipulators, 
autonomous mobile robots, and sensors 
(Robotics and Automation). 

Abraham Kandel, Tallahassee, Florida, for 
contributions to fuzzy-set theory and its 
dissemination, and to the application of 
uncertainty management techniques to en¬ 
gineering systems (Systems, Man, and Cy¬ 
bernetics). 

Charles R. Kime, Madison, Wisconsin, for 
contributions to system fault diagnosis and 
VLSI testing (Computer). 


Donald E. Kirk, San Jose, California, for 
contributions to signals and systems educa¬ 
tion (Education). 

Der-Tsai Lee, Evanston, Illinois, for con¬ 
tributions to computational geometry and 
the design and analysis of algorithms 
(Computer). 

Victor On-Kwok Li, Los Angeles, Califor¬ 
nia, for contributions to the design and 
analysis of communications networks 
(Communications). 

Ren-Chyuan Luo, Raleigh, North Caroli¬ 
na, for contributions to intelligent sensors 
and robotics (Industrial Electronics). 

John S. MacDonald, Vancouver, Canada, 
for leadership in the development of tech¬ 
niques in terrestrial monitoring (Geo¬ 
science and Remote Sensing). 

A. Frank Mayadas, Yorktown Heights, 

New York, for contributions to magnetic 
storage and thin-film technology and sys¬ 
tems (Magnetics). 

James McKenna. Morristown, New Jersey, 
for contributions to the mathematical mod¬ 
eling of communication and computer sys¬ 
tems with queueing networks (Computer). 

Jayadev Misra, Austin, Texas, for contri¬ 
bution to the algorithms and theory of dis¬ 
tributed computing (Computer). 

Amr M. Mohsen, Saratoga, California, for 
contributions to integrated-circuit technol¬ 
ogies (Solid-State Circuits). 

Richard R. Muntz, Pacific Palisades, Cali¬ 
fornia, for contributions to the theory of 
queuing networks and computer system 
performance (Computer). 

Rajnikant V. Patel. Montreal, Canada, for 
contributions to the numerical analysis and 
design of control systems (Control Sys¬ 
tems). 

Lalit M. Patnaik, Bangalore, India, for 
contributions to distributed and parallel 
computing (Computer). 

Yale N. Patt. Ann Arbor, Michigan, for in¬ 
novative contributions to high-speed com¬ 
puter architecture (Computer). 

Keith S. Pennington, Yorktown Heights, 
New York, for contributions to nonimpact 
printing and image-processing technologies 
(Computer). 

Aristides A.G. Requicha, Los Angeles, 
California, for contributions to the theory 
and practice of three-dimensional geomet¬ 
ric modeling, and to its applications in pro¬ 
grammable automation (Robotics and Au¬ 
tomation). 

Juan A. Rodriguez, Boulder, Colorado, for 
technical leadership in the development of 
computer data-storage devices (Magnet- 
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John E. Savage, Providence, Rhode Island, 
for contributions to the theory of computa¬ 
tion and the design and analysis of VLSI 
algorithms (Computer). 

John P. Shen, Pittsburgh, Pennsylvania, for 
contributions to fault-tolerant computing 
and to VLSI circuit design (Computer). 

Louis C. Sheppard, Galveston, Texas, for 
contributions to biomedical engineering, 
especially in the area of closed-loop con¬ 
trol of blood pressure in patients following 
surgery (Engineering in Medicine and Bi¬ 
ology). 

Chintay Shih, Hsinchu, Taiwan, for contri¬ 
butions to semiconductor manufacturing 
technology, and for leadership in the 
growth of the semiconductor industry (En¬ 
gineering Management). 

Kang G. Shin, Ann Arbor, Michigan, for 
contributions to the theory of dynamic fail¬ 
ure in real-time computing systems (Com¬ 
puter). 

Barry G. Silverman, Washington, District 
of Columbia, for the introduction and ap¬ 
plication of intelligent decision-support 
systems that combine knowledge-based 
and decision-science techniques (Engineer¬ 
ing Management). 

James R. Slagle, Minneapolis, Minnesota, 
for contributions to artificial intelligence, 
especially computer symbolic mathematics, 
heuristic search, automated theorem prov¬ 
ing, knowledge representation, and expert 
systems (Computer). 

Lawrence Snyder, Seattle, Washington, for 
contributions to parallel computer pro¬ 
gramming, computer architecture, and the 
theoretical foundations of computer sci¬ 
ence (Computer). 

John D. Spragins, Clemson, South Caroli¬ 
na, for contributions to telecommunica¬ 
tions education and to performance and re¬ 
liability modeling of computer networks 
(Communications). 

Charles H. Stapper, Jericho, Vermont, for 
contributions to the science of defects and 
fault tolerance in VLSI circuits (Solid- 
State Circuits). 

Kunimaro Tanaka, Chiba, Japan, for con¬ 
tributions to digital audio recording tech¬ 
nology (Consumer Electronics). 

Willis J. Tompkins, Madison, Wisconsin, 
for contributions to biomedical engineer¬ 
ing education (Engineering in Medicine 
and Biology). 

Kishor S. Trivedi, Durham, North Caroli¬ 
na, for contributions to reliability and per¬ 
formance analysis of computer and com¬ 
munication systems (Computer). 

Edward C. Van der Meulen, Heverlee, 
Belgium, for contributions to the study of 
multiway channels in information theory 
(Information Theory). 


Joseph P. Vandewalle, Blanden, Belgium, 
for contributions to the mathematics of 
nonlinear circuits and systems (Control 
Systems). 

Jean-Jacques Werner, Middletown, New 
Jersey, for contributions to echo-cancella¬ 
tion techniques for full-duplex data com¬ 
munications (Communications). 


The Nominations Committee of the 
IEEE Computer Society is inviting 
suggestions from the membership for 
Board of Governors and officer nomi¬ 
nees to serve terms starting January 1, 
1993. Seven board seats will be filled 
during this fall’s membership election, 
with the top vote-getters winning 
three-year terms. 

Also in this year’s election, the 
membership will select first and sec¬ 
ond vice presidents, who will serve in 
1993, as well as the 1994 president, 
who will serve as president-elect in 
1993. 

The Nominations Committee must 
receive recommendations for Board 
of Governors and officer nominees by 
March 31,1992. The recommenda¬ 
tions must be accompanied by bio¬ 
graphical information and should in¬ 
clude facts about the nominee’s past 
and present participation in the soci¬ 
ety’s activities. These materials should 
be sent to Duncan H. Lawrie, c/o 
IEEE Computer Society, 1730 Massa¬ 
chusetts Ave. NW, Washington, DC 
20036-1903, e-mail d.lawrie@ 
compmail.com. 

Election timetable. The following 
timetable for nominations and elec¬ 
tions (for events following the com¬ 
mittee’s March 31,1992, nominations 
deadline) has been approved by the 
Board of Governors. 

May 8 — Nominations Committee 
sends its slate of officer and board 
candidates to the Board of Governors. 

May 22 — Last day for submission 
of board/officer petition candidates’ 
petitions to the society’s secretary, 
Ronald G. Hoelzeman, c/o IEEE 
Computer Society, 1730 Massachu¬ 
setts Ave. NW, Washington, DC 
20036-1903, e-mail hoelzeman@ee. 
pitt.edu, signed by members of the 
1992 Board of Governors. 

June 5 — Board and officer candi¬ 
dates selected at Board of Governors 
meeting in New Orleans. 

July — Computer publishes the 


Gio Wiederhold, Washington, District of 
Columbia, for contributions to database 
management and artificial intelligence 
technology applications (Computer). 


Chuan-lin Wu, Austin, Texas, for contribu¬ 
tions to the theory of interconnection net¬ 
works (Computer). 


board-approved slate of candidates 
and calls for additional petition candi¬ 
dates from the membership. (The pro¬ 
cess allowing members to nominate 
additional candidates by petition is 
described below.) 

July 3 — Position statements, biog¬ 
raphies, and photos of candidates se¬ 
lected by the Board of Governors are 
due at the society’s office in Los 
Alamitos, California. 

July 31 — Petitions with candidates’ 
position statements, biographies, and 
photos must be received by the soci¬ 
ety’s secretary, Ronald G. Hoelzeman 
(see address above). 

August 14 — Ballots are mailed to 
the membership. 

September — Computer publishes 
position statements, biographies, and 
photos of the candidates. 

October 26 — Last day for receipt 
of marked ballots. 

December — Computer publishes 
election results. 

Petitioning nominations. Petitions 
to add nominees to the list of candi¬ 
dates previously selected by the 
Board of Governors for the board or 
for officers must: 

• Set forth the office, the starting 
date of the office, and the name of 
the candidate. 

• Contain, for board nominees, the 
signatures of at least 250 voting 
members of the society, with each 
member eligible to sign only one 
Board of Governors’ petition and, 
for officer nominees, the signa¬ 
tures of at least 1,000 voting mem¬ 
bers of the society, with each 
member eligible to sign one peti¬ 
tion for each office. Each signa¬ 
ture must be accompanied by the 
printed name of the signer. To 
avoid ambiguous identification of 
each signer, it is recommended 
(although not required) that the 
signer’s membership number ac¬ 
company his or her signature. 


Computer Society announces call for nominees, 
1992 election timetable 
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• Be accompanied by a statement 
signed by the nominee that he or 
she is willing and available to 
serve, if elected. 

• Be received by Secretary Hoelze- 
man on or before July 31,1992. 

To qualify as a candidate for the 
board, a nominee must be a member 
of the society, must agree to seek sig¬ 
nificant involvement in society activi¬ 
ties such as chairing a committee or 
serving as editor-in-chief or society 
representative, etc., and must meet 
other requirements as stated in the 
constitution and bylaws. 

To qualify as a candidate for an of¬ 
ficer position, a nominee must hold a 
member grade or higher in the IEEE, 
must have been a Computer Society 
member for the preceding three years, 
and must meet other requirements as 
stated in the constitution and bylaws. 

Petitions can also be submitted to 
the secretary by Compmail. To be 
counted as a signature on a Compmail 
petition, each person signing the peti¬ 
tion by Compmail must originate a 
message stating his/her support of the 
nomination of the individual con¬ 
cerned and transmit it to Hoelzeman. 
His Compmail ID is r.hoelzeman@ 
compmail.com. 


CALL FOR 
PAPERS!!! 


Achieving A Quality 
Software Process 

The 2nd INTERNATIONAL 
CONFERENCE ON SOFTWARE 
QUALITY will be held in the Research 
Triangle Park, North Carolina on October 
5-7,1992.2ICSQ invites those interested in 
sharing their ideas, experiences, research, 
and/or lessons learned to submit an 
abstract, tutorial, or panel proposal. 
Please send three copies of your 
abstract and author biography by 
April 6,1992. 

2ICSQ will be held in coordination with 
the Third International Symposium on 
Software Reliability Engineering -1992. 


FOR MORE INFORMATION, PLEASE CONTACT: 


Sue McGrath, CQA 
SAS Institute Inc. 

SAS Campus Drive 
Cary, NC 27513 
(919)677-8000 
sassam@dev.sas.com 


John E. Lowe, SQA 
Litton Computer 
Services 

4020 Executive Drive 
Dayton, Ohio 45430 
(513)429-6458 


Please submit abstracts and biography to 
Sue McGrath at the address above. 


= Sponsored by: = - 

American Society for Quality ^ 
Software Division 


UPDATE 


Send news about research, public policy, or professional issues to Bob Carlson, 
Computer, 10662 Los Vaqueros Circle, P0 Box 3014, Los Alamitos, CA 90720-1264 


Rear Adm. Grace Hopper dies at 85 


Rear Admiral Grace Murray Hop¬ 
per, pioneer computer programmer 
and co-inventor of Cobol, died Janu¬ 
ary 1 at her home in Arlington, Vir¬ 
ginia. Known as the mother of com¬ 
puterized data automation in the US 
naval service, she retired from the 
Navy in 1986, having been the oldest 
military officer still on active duty. 
Her retirement ceremony was con¬ 
ducted aboard the USS Constitution, 
the service’s oldest commissioned 
warship. 

After her Navy career, she worked 
as a senior consultant at Digital 
Equipment Corp., focusing on govern¬ 
ment issues and participating in cor¬ 
porate educational programs. 

In September 1991, President 
George Bush awarded her the Nation¬ 
al Medal of Technology. The first 
woman to receive the award as an in¬ 
dividual, she was cited for “pioneering 
accomplishments in the development 
of computer programming languages 
that simplified computer technology 
and opened the door to a significantly 
larger universe of users.” 

Hopper earned a PhD in mathemat¬ 
ics from Yale University and began 
her career teaching math at her alma 
mater, Vassar College. Later, she 
worked as a mathematician at Eckert- 
Mauchly Corp., which was building 
the Univac I. 

She joined the US Naval Reserve in 
1943, where as a lieutenant assigned 
to the Bureau of Ordnance Computa¬ 
tion Project at Harvard University, 
she was a programmer on the first 
large-scale digital computer, the Mark 
I. It was here, the story goes, that the 
first computer “bug” was discovered. 
According to Hopper, a circuit mal¬ 
function was traced to the presence of 
a large moth, and from that time for¬ 
ward computer problems of all sorts 
were blamed on “bugs.” 

Though she left the Navy in 1946, 
she remained at Harvard as a faculty 
member in the computation laborato¬ 
ry, working on the Mark I and Mark 
II Navy computers and continuing as 
an active-duty reservist. 

The service’s age limitation led to 
her retirement from the Naval Re¬ 
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serve in 1966, but “Amazing Grace,” 
as she was known, was soon recalled 
to full-time active duty to standardize 
the Navy’s various computer languag¬ 
es. Assigned to the Naval Data Auto¬ 
mation Command in Washington, 
D.C., she continued her work in com¬ 
puter language techniques. 

Hopper received the IEEE Com¬ 
puter Society’s 1979 W. Wallace Mc¬ 
Dowell Award for her “combination 
of technical skill, leadership, teaching 
capability, and single-minded drive 
for the invention, adoption, and stan¬ 
dardization of high-level program¬ 
ming languages.” 

Also among her awards is the De¬ 
fense Department’s highest honor, the 
Defense Distinguished Service Medal 
for exceptional meritorious service. 

In addition, she received many hon¬ 
orary degrees and was recipient of the 
first Computer Sciences “Man of the 
Year” award given by the Data Pro¬ 
cessing Management Association. Her 
many accomplishments, honors, and 
appointments are listed in “Who’s 
Who.” 

Hopper was a life fellow of the 
IEEE and a longtime member of the 
IEEE Computer Society. 
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PRODUCT REVIEWS 


Editor: Richard Eckhouse, University of Massachusetts-Boston. 
Send review submissions to MOCO Inc., 776A Country Way, N. Scituate, MA 02066; Compmail r. eckhouse; Internet, eckhouse@cs.umb.edu. 


Unix hardware and software 

T.L. (Frank) Pappas, Aweb Software Technology 


When I got my first PC, I was really 
happy that I didn’t have to depend on 
a mainframe any more. I didn’t have 
to worry as much about such things as 
down time and system load. However, 

I grew tired of wasting time waiting 
for my text-processing software and 
compilers, so I moved on to a 386 and 
installed Unix. Once my Unix vendor 
added X Windows support and a great 
DOS emulator, my productivity really 
increased as I moved further away 
from DOS. 

Still, even with a 33-MHz from 
Arche Technologies, I needed more 
power. However, I wasn’t expecting 
the enormous boost in productivity I 
got from Sun Microsystem’s 25-MHz 
Sparcstation IPC desktop computer. 

Since a great deal has already been 
written about Unix and Sparc systems, 
I won’t get into them here. Instead, I 
want to give users — especially DOS 
users — an idea of the benefits of us¬ 
ing the IPC. Experienced Unix users 
may also be interested in some of my 
experiences with the IPC and X termi¬ 
nals. 

The system unit, which is 9.5 x 10.5 
inches, and less that 5 inches high, in¬ 
cludes a 3.5-inch high-density floppy 
and a 208-Mbyte hard disk. The 16- 
inch color monitor, which I have sit¬ 
ting on top of the unit, has a 1,152 x 
900 resolution. The monitor, including 
the tilt and swivel base, is 16 x 17 
inches and 16 inches high. All monitor 
controls are up front where they are 
easy to get to. With 8 Mbytes of 
RAM, this standard unit sells for 
$7,995. It comes with Sun OS (version 
4.1.1 on my system), a C compiler, lots 
of utility programs, the X-based 
OpenWindows (the one I used), and 
the Solaris environment, which con¬ 
tains the Open Network Computing/ 
Network File System. 

My unit has 24 Mbytes of RAM, 
which you’ll need to run some of the 
memory-intensive development or 
desktop publishing systems. It also has 
a CD-ROM player and an external 


unit with a 669-Mbyte hard drive and 
a cartridge tape drive. The CD-ROM 
unit is about the same size as the sys¬ 
tem unit but not quite so tall. The ex¬ 
ternal unit is 7.5 x 20.5 inches and 
measures 9.5 inches in height. With 
the CD-ROM player sitting on the ex¬ 
ternal unit, this configuration takes up 
about the same space as my 386 with 
the monitor. 

One of the first things I realized 
when I started using OpenWindows is 
just how restrictive Microsoft Win¬ 
dows is to work with. With OpenWin¬ 
dows, I don’t have to worry about ill- 
behaved programs, setting task 
priorities, creating a PIF file, or deal¬ 
ing with any other annoying features 
of Windows and DOS. 

Programs written for X and Open- 
Windows easily coexist with programs 
that know nothing about windowing 
systems. The latter start up in their 
own command-tool window. This win¬ 
dow is just a c-shell window with 
scrolling that maintains a scrollable 
history of all command-line activity. I 
found this feature useful when I was 
installing public domain software. 
When I entered the wrong value for 
an environment variable, the installa¬ 
tion failed after several screensful of 
data were output in the command 
window. I just scrolled back through 
the window, found the environment 
variable, copied it to the command 
line, made the necessary changes, and 
hit the Enter key to reset the variable. 
Of course, you can also copy and 
paste or cut and paste from one win¬ 
dow to another. 

One product I wanted to include in 
this review was DOSWindows, which 
provides DOS emulation. Sun de¬ 
clined to provide me with a copy be¬ 
cause it will be introducing a greatly 
enhanced version of DOSWindows 
about the same time this review will 
appear. I hope to look at DOSWin¬ 
dows in a future issue, but I wanted 
to let DOS users know that by mov¬ 
ing to an IPC, they may be able to 


get the best of both Unix and DOS 
worlds. 

Many popular DOS applications are 
now available under Sun Unix. What 
this means for DOS users is that they 
can migrate to the IPC, save some of 
their investment in DOS programs, 
and upgrade to Unix versions as the 
need arises. 

For example, I tried out versions of 
CorelDraw and WordPerfect on the 
IPC and found I liked them better 
than their DOS counterparts due to 
the noticeably better 1,152 x 900 dis¬ 
play and a much faster response. They 
were also well integrated into the 
OpenWindows environment. I could 
easily size and move the applications 
windows and switch to other windows 
much more effectively than I could 
under Microsoft Windows. 

Sun Unix offers many applications 
not available under DOS. For exam¬ 
ple, I tried the FrameMaker document 
publishing system from Frame Tech¬ 
nology, which rivals the best DOS 
publishing systems. I also tried Cen- 
terLine Software’s ObjectCenter, cur¬ 
rently the top of the line in C++ de¬ 
velopment environments. Anyone 
doing serious C++ development work 
should take a good look at Object- 
Center. It offers a graphical develop¬ 
ment environment with a C++ inter¬ 
preter and an assortment of browsers 
that significantly increase C++ devel¬ 
opment productivity. 

One product I really like is Sun’s 
AnswerBook. This OpenWindows- 
based hypertext system makes the 
standard Sun operating system manu¬ 
al available on line. Note, however, 
that loading all of AnswerBook onto a 
hard disk eats up a whopping 240 
Mbytes. Sun recommends this for en¬ 
vironments where AnswerBook will 
be used extensively by several sys¬ 
tems. Instead, I used an installation 
option that keeps the bulk of Answer- 
Book on the CD-ROM it comes on, 
loading only the databases, search in¬ 
dexes, and a few other files. This op- 
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tion, which Sun says is fine for light 
use by several systems or medium use 
on one system, requires 50 Mbytes of 
disk space. 

I used the Unix man program when 
I needed a quick overview of a com¬ 
mand and AnswerBook when I need¬ 
ed more extensive information or 
wanted to quickly search through sev¬ 
eral manuals. I only looked at the pa¬ 
per versions when I wanted to do a 
good deal of reading. 

I also connected five different com¬ 
panies’ X terminals to the IPC, one at 
a time. While all of them worked with 
the IPC, some worked better than 
others. My clear favorite was a 19- 
inch color X terminal from Hewlett- 
Packard. If you want a top-of-the-line 
X terminal, that’s the one to get. 

On the other hand, I like the idea of 
using the IPC instead. It outper¬ 
formed the X terminals on the bench 
tests that I ran. I don’t know how 
Sun’s 19-inch color monitor would 
stack up against the HP X terminal, 
but the IPC with the 16-inch monitor 
provides the features you need, along 
with local processing and a hard disk. 
Even without the local disk, the IPC 
would be a good X terminal, but with 
the disk it is the kind of X terminal 
that power users need. 

Readers may contact Sun Micro¬ 
systems at 2550 Garcia Ave., Moun¬ 
tain View, CA 94043-1100; (800) 821- 
4643; or circle Reader Service Num¬ 
ber 21. 


Noncommercial software 


Normally this column reviews com¬ 
mercial products, but this month we 
pay attention to software available in 
source form — some of it for next to 
nothing. I looked at four different 
sources: the GNU software from the 
Free Software Foundation, the Unix 
Tex distribution distributed by the 
University of Washington, the STARS 
Foundation software distributed by 
ASSET, and the Simtel 20 Ada repos¬ 
itory distributed by the Grebyn Corp. 

GNU software. GNU (the G in 
GNU is not silent), which stands for 
GNU’s Not Unix, is an effort by the 
Free Software Foundation to produce 
a Unix-like system that can be freely 
distributed. In fact, that’s what the 
free in the foundation’s name means; 
everyone is free to use and distribute 
the software. 

GNU software is copyrighted to en¬ 
sure that this freedom remains unvio¬ 


lated. Specifically, this means you can 
give the binary or Source version of 
any GNU product to anyone. Using 
GNU software in your own product 
makes your software free as well. You 
can sell the binary, but you also have 
to make the source available for a rea¬ 
sonable cost. 

As part of the Unix development 
effort, the Free Software Foundation 
has developed a host of Unix tools 
and other software, some of which has 
been ported to other systems, includ¬ 
ing MS DOS. Some of the software 
was written by the FSF itself, but a 
good part of it was written by volun¬ 
teers, some on loan from major corpo¬ 
rations. The FSF, which is a nonprofit 
organization, survives from contribu¬ 
tions, donations of equipment, and 
the distribution of their software and 
manuals. 

Let me emphasize that this software 
has not been thrown together. Many 
companies take advantage of GNU 
software because of its reliability. Of 
course, one good things about the 
GNU effort is that having the source 
lets you add features. You can even 
send the changes to the FSF, which 
might include them in future releases 
if they are of general use. 

If you have Internet access, you can 
use the anonymous file-transfer proto¬ 
col (FTP) to get the software yourself 
from prep.ai.mit.edu. You can get 
more information by obtaining the 
file /pub/gnu/GETTING.GNU. 
SOFTWARE. However, since it 
comes with a great deal of GNU soft¬ 
ware, you might be better off getting 
it directly from the FSF and then up¬ 
dating from Internet. 

Before telling you about the GNU 
programs that I installed, let me de¬ 
scribe what GNU offers. You can get 
four tapes from the FSF, with prices 
varying from $195 to $230 each, de¬ 
pending on whether you want 9-track 
or cartridge tape. The GNU Emacs 
tape contains the GNU Emacs editor, 
one of the most powerful editors ever 
written. It also includes the GNU de¬ 
bugger, Emacs Lisp, Scheme, com¬ 
pression software, and GNU Chess 
(which I’m told has a high chess rat¬ 
ing)- 

The GNU C compiler tape also in¬ 
cludes a portable optimizing compiler, 
the GNU debugger and assembler, Bi¬ 
son and Flex (free versions of yacc 
and lex, respectively), Bash (for 
Bourne Again Shell, a version of the 
Unix sh shell with extensions found in 
the csh and ksh shells), the GNU awk 
processor, a revision control system, 
free versions of tar, diff, grep, and 
nroff, Ghostscript (a PostScript-com¬ 


patible language processor), and gnu- 
plot (an interactive program for plot¬ 
ting mathematical expressions and 
data). 

There are also two MIT X Window 
System V11R5 tapes. One tape in¬ 
cludes the required X software and 
contributed client software. The other 
tape contains optional contributed 
software including libraries, games, 
Andrew, and other toolkits. 

All tapes include documentation for 
the products, many of which are in 
Tex or nroff format. Printed copies of 
the documentation are available for a 
modest price. Now to the GNU prod¬ 
ucts I actually installed. 

GNU Emacs. Richard Stallman, 
president of the Free Software Foun¬ 
dation, wrote the first Emacs editor in 
1975 while at MIT. GNU Emacs, his 
second implementation of Emacs, is in 
widespread use. Hosted on dozens of 
machines, Emacs is an easily extensi¬ 
ble editor written in a combination of 
C and ELisp, the Emacs Lisp inter¬ 
preter. 

Many readers are probably already 
familiar with other Emacs editors, so I 
will just highlight a few main features 
for those of you who are not. You can 
adjust Emacs by using ELisp to modi¬ 
fy existing commands or to write new 
ones. You can also adjust Emacs by 
just changing the settings of Emacs 
variables. 

The powerful on-line help system 
makes it easy to find out about com¬ 
mands, variables, or key bindings. The 
apropos help feature lets you ask for 
all commands related to a particular 
editing feature, say files. The entire 
Emacs manual is available on line 
through Info, a program used for 
browsing structured documents. 

Emacs supports multiple buffers 
used to support editing of multiple 
files. Multiple windows are provided 
for viewing the files. Buffers are also 
used to display Unix directories. A di¬ 
rectory editor makes it easy to delete 
or visit files in the directory or to 
move through the directory structure. 
For text editing, Emacs has very pow¬ 
erful search-and-replace features, in¬ 
cluding a full regular expression capa¬ 
bility. 

For software development, Emacs 
provides features for calling compilers 
or other external programs. It also has 
various editing modes for program¬ 
ming in ELisp, C, and Fortran, or for 
writing documentation in nroff or 
Tex. Public domain files are available 
for adding an Ada mode as well. 

Installing Emacs was pretty easy. 

All it took was a few changes to some 
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configuration files, and it was up and 
running. I used the GNU C compiler 
to compile the C code. 

GNU Emacs gnuplot and calcula¬ 
tor. Two really good programs that I 
also installed as part of Emacs were 
gnuplot and calc. The former is an in¬ 
teractive program for plotting mathe¬ 
matical expressions and data. It is on 
the compiler tape. Calc is a calculator 
mode for Emacs. It is available 
through anonymous FTP. 

Saying that calc is a calculator is 
like saying that a Rolls Royce is a car. 
Not only does calc provide standard 
calculator features, but you can write 
your own mathematical functions, 
evaluate them, solve systems of simul¬ 
taneous equations, and take deriva¬ 
tives. You can even have calc call gnu¬ 
plot to plot expressions or data in a 
pop-up window. 

If having a powerful, extensible edi¬ 
tor is not enough reason for having 
GNU Emacs, calc and gnuplot are — 
particularly if you perform mathemat¬ 
ical analysis. 

GNU C compiler. Many companies 
and universities use this robust opti¬ 
mizing compiler for ANSI C. I com¬ 
piled Emacs and the Unix Tex distri¬ 
bution using GNU C, and had no 
problems with it. In fact, since my ver¬ 
sion of Sun OS has two X libraries 
that I needed, I compiled them using 
GNU C. Although I didn’t try it, there 
is a C++ front end for GNU C, as well 
as a C++ library. 

Installing GNU C wasn’t very diffi¬ 
cult, again just requiring a few adjust¬ 
ments to some make files. I used the 
native Sun C compiler to bootstrap 
the compiler and then recompiled 
GNU C with the resulting compiler. 
Finally, I used GNU C to recompile 
itself one more time and then did a 
byte comparison on corresponding 
object modules from the last two 
builds to make sure the compiler was 
producing consistent results. GNU C 
make files provided scripts for doing 
all of this. It took about an hour to 
compile GNU C three times and com¬ 
pare the object modules. 

The Free Software Foundation is 
working on a major new release of the 
C compiler and C++ front end. The 
compiler will also support Objective 
C. There’s no definite date for the re¬ 
lease, but it may be ready by the time 
you read this review. 

By the way, Next, which donated 
the Objective C front end, used GNU 
C to build its entire system, including 
its port of NSF and Carnegie Mellon 
University’s Mach kernel. The Open 


Software Foundation uses GNU C as 
the compiler in its operating system. 
Data General uses it for the Aviion 
88000-based workstation, Intel uses it 
for the 960 microprocessor, and Com¬ 
modore-Amiga uses it for Amiga 
Unix. 

Readers can contact the Free Soft¬ 
ware Foundation at 675 Massachusetts 
Ave., Cambridge, MA 02139; (617) 
876-3296; e-mail: gnu@prep.ai.mit. 
edu; or circle Reader Service Number 
22 . 

The Unix Tex distribution. The two 

other times I reviewed the Tex (fre¬ 
quently printed as “TgX”) document 
preparation system in this column, I 
examined commercial versions on the 
IBM PC. When I decided to look at 
the Unix Distribution of Tex from the 
University of Washington, I wondered 
how different it could be — and 
whether it was really worth the effort 
of reviewing it. I was really surprised 
to find how much better this public 
domain distribution is than the com¬ 
mercial versions. I expected that it 
would take considerably less time to 
format documents, and I was right. 
What I didn’t expect was the high- 
quality Tex and support programs. 

For those of you who are not up on 
Tex, when you process a Tex file, it 
produces a dvi (device independent) 
file. To print the formatted document, 
you need another program, called a 
printer driver, that outputs the dvi file 
to your particular printer. If you want 
to view the file, you need a driver that 
allows you to preview the file on your 
display device. 

The Unix Tex distribution includes 
drivers for several printers, including 
a generic PostScript printer, although 
it does not contain an HP LaserJet 
driver. The PostScript driver, dviPS, 
provides most of what you need, in¬ 
cluding PostScript graphics. 

The dviselect utility allows you to 
select which pages you want to print. I 
prefer to create double-sided copies 
of documents and papers because they 
are easier to use and also save paper. 
The utility dviselect provides this op¬ 
tion, which none of the PC versions of 
Tex have. In addition to printing page 
ranges, you can specify which chapters 
you want printed. 

The previewer, xdvi, is an easy-to- 
use X-based previewer. Anyone who 
normally uses a PC previewer will be 
amazed at how fast xdvi loads a dvi 
file and displays a page. 

Tex and its font generator, 
Metafont, are written in Pascal. The 
Unix Tex distribution includes a Pas- 
cal-to-C translator. I compiled the re¬ 


sulting C files with GNU C. It was 
more difficult to configure Tex, 
Metafont, and the drivers because dif¬ 
ferent sites like to use different con¬ 
ventions for Tex’s font directory 
structure. But after a few false starts, I 
was able to create the programs. 

The Unix Tex distribution is re¬ 
markably well organized, especially 
since it is maintained by a volunteer 
staff and incorporates programs writ¬ 
ten by a variety of programmers, some 
of whom think they need to write very 
little documentation on how to config¬ 
ure and compile their programs. The 
volunteer staff at the University of 
Washington has done a good job of 
adding tips and clarifications. 

Actually, saying that Tex and 
Metafont are written in Pascal is not 
quite correct. They are actually writ¬ 
ten in a language called Web, which 
combines Pascal and Tex and allows 
the documentation of a program and 
its Pascal code to be combined. Two 
programs, Weave and Tangle, pro¬ 
duce Tex and Pascal source files, re¬ 
spectively. The Unix Tex distribution 
also includes CWeb, a C version of 
Web. 

I was pleasantly surprised that Spi¬ 
derweb was included in the distribu¬ 
tion. Developed by Norman Ramsey 
at Princeton University, Spiderweb is 
a Web language generator. When you 
enter a grammar specification for a 
language. Spiderweb generates a Web 
for that language. The current version 
of Spiderweb includes grammars for 
Ada, awk, C, and C++. 

Unix Tex is available for $75 from 
the University of Washington. This 
pays for the tape you receive and 
helps with supplies and costs involved 
in maintaining the distribution. 

Readers may contact Tex Support 
at the Northwest Computing Support 
Center, DR-10, University of Wash¬ 
ington, Seattle, WA 98195; (206) 546- 
6259; e-mail: mackay@cs.washington. 
edu; or circle Reader Service Number 
23. 

STARS Foundation Software. The 

STARS (Software Technology for 
Adaptable Reliable Systems) Founda¬ 
tion, under the auspices of the Naval 
Research Laboratory (NRL), devel¬ 
oped reusable Ada building blocks in 
key technology areas. The work in¬ 
cluded 33 completed contracts. Partic¬ 
ipants include the Stanford Research 
Institute, Lockheed, Grumman Data 
Systems, Computer Sciences Corp., 
SAIC, Unisys, Westinghouse, BDM, 
General Dynamics, ITT, and Ford 
Aeronutronic Division. 

Some of the reusable components 
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Noncommercial reading 

I came upon four books that you might find useful to research noncommercial 
software. 

C. A. Finseth’s, The Craft of Text Editing: Emacs for the Modern World, Spring- 
er-Verlag, New York, 1991, covers the basics of writing an Emacs editor. It should 
be useful for anyone who wants to implement an Emacs-like editor or is planning 
on maintaining GNU Emacs. In spite of a few inane comments about Ada and text 
editing, the book is worth reading. 

D. Cameron and B. Rosenblatt’s Learning GNU Emacs, O’Reilly & Associates, 
Sebastopol, California, 1991, is an outstanding introduction to GNU Emacs for 
newcomers. An especially good chapter on using GNU Emacs under X explains 
how to create pop-up menus and attach them to mouse clicks. I also recommend 
the Free Software Foundation’s GNU Emacs manual as follow-up reading for us¬ 
ers who want to get the most out of Emacs. 

S.V. Bechtolsheim’s Tex in Practice, Springer-Verlag, New York, 1992, is a 
four-volume, 1,600-page treatise for serious Tex users who want to maximize the 
program. As much as I like D. Knuth’s The TexBook, it is difficult to completely 
learn Tex from it unless you can devote a great deal of concentrated time to the 
process, something that I don’t have the luxury of doing. (I also think that’s why 
Tex has never been as popular as it should have been.) However, Tex in Practice 
should go a long way toward increasing Tex’s popularity and unlocking its se¬ 
crets. 

M.B. Feldman and E.B. Koffman’s Ada Problem Solving and Program Design, 
Addison-Wesley, Reading, Massachusetts, 1991, is “inspired by, and borrows 
much material from,” Koffman’s popular Problem Solving and Structured Pro¬ 
gramming with Pascal and Problem Solving and Structured Programming with 
Modula-2. However, much of the material is new, and the presentation order has 
been changed. Since the book is a CS1-oriented introduction to Ada that requires 
no previous programming experience, it is ideal for a first semester introduction to 
programming. It comes with a disk containing the source code for text examples. 

- T.L. (Frank) Pappas, Aweb Software Technology 


include Ada bindings to SQL, CICS, 

X Windows, and GKS. There are sev¬ 
eral Ada components to help in test¬ 
ing and debugging: an Ada/SQL test 
data generator, an embedded system 
debugger, and a universal test lan¬ 
guage for testing complex systems. 
Some components help to support 
Ada runtime environments: a tool to 
manage tailored support environ¬ 
ments; distributed support; and servic¬ 
es for complex, time-critical embed¬ 
ded applications. 

Some of the components facilitate 
software development: 

• an interactive, object-oriented 
command language for Ada; 

• a user-configurable, tailorable text 
editor with syntax-directed editing 
for Ada and SGML; 

• a configuration management assis¬ 
tant for Ada; 

• a reusability library framework for 
creating and maintaining reposito¬ 
ries of reusable components; 

• a graphical design tool for interac¬ 
tively creating object-oriented de¬ 
sign diagrams and translating 
them into compilable Ada; and 

• a rapid storage/retrieval of reus¬ 
able components. 

In addition to a layered, multilevel 
secure DBMS prototype and a secure 
FTP implementation, there is also an 
implementation of SGML and a pars¬ 
er generator for Ada code. 

There are even components that 
some people think can’t be written in 
Ada: components for optimization 
and planning, such as the SCT terrain- 
masking algorithm, dynamic program¬ 
ming, and the Dijkstra shortest path 
algorithm; components for image pro¬ 
cessing; and components for pattern 
recognition. 

The quality of documentation could 
be better. Some companies did a good 
job of writing manuals and reports de¬ 
scribing the components, while others 
did not. That really surprised me be¬ 
cause the components were developed 
under government contract. Although 
I don’t particularly like the Depart¬ 
ment of Defense documentation stan¬ 
dards, government contracts usually 
do require extensive documentation. 
The other thing that surprises me is 
that companies would let software go 
without documentation. Since the 
STARS components are widely dis¬ 
tributed, I would think these compa¬ 
nies would be worried about how 
poorly it reflects on them — but then 
some people don’t seem to care about 
quality. 

One other observation is that the 


components can be very useful as 
building blocks. If you are doing any 
serious work in Ada, you should get 
these components. 

Originally, the STARS components 
cost $2,000, but they are now free in 
order to promote their use. When I 
first obtained them, all I had to do 
was send two 1/4-inch cartridge tapes. 
They were returned with the compo¬ 
nents. However, since the organiza¬ 
tion that was distributing the compo¬ 
nents has now turned over its 
responsibility to ASSET, more op¬ 
tions and components are available. 

The ASSET (Asset Source for Soft¬ 
ware Engineering Technology) library 
is funded by the STARS program, 
with IBM as the prime contractor. In 
addition to the 33 assets (groupings of 
related files) of the STARS founda¬ 
tion software, the ASSET library con¬ 
tains another 119. There is also a 1- 
Gbyte component backlog waiting to 
be cataloged. 

The ASSET library is accessible 
through Internet and dial-up lines. 
Readers may contact ASSET at 2611 
Cranberry Sq., Morgantown, WV 
26505; (304) 594-3954; or circle Read¬ 
er Service Number 24. 


Review notes 

GammaFax CP. My fax use is gen¬ 
erally limited to receiving proofs of 
reviews and articles, or receiving in¬ 
formation from vendors. I never really 
cared for the output quality that a fax 
produces, so I rarely used it in my 
business. Since I don’t have a separate 
phone line for my fax use, a fax ma¬ 
chine doesn’t make sense. And, since 
the Quadram Fax card and software 
that I was using never struck me as 
friendly, sending or receiving a fax 
was always too much trouble. 

The GammaFax CP card from 
GammaLink has changed my mind. 
The software is usually easy to use, 
and the on-board 10-MHz 80188 pro¬ 
cessor and 512-Kbyte RAM let me 
send or receive faxes in the back¬ 
ground without any impact on fore¬ 
ground processing. For example, 
while receiving a 24-page fax, I edited 
a Tex file, ran an extended memory 
version of Tex on the file, and pre¬ 
viewed the PostScript-generated doc¬ 
ument on my monitor, all without any 
noticeable slowdown. 

The GammaFax CP is the top of the 
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GammaFax line. The network-ready 
board is programmable and, according 
to the company, optimized for use as a 
fax switch in a heavy-duty network. 

At $1,095, it’s a little steep for individ¬ 
ual use, but the half-size GammaFax 
Xpress, which sells for $595, is more 
affordable with nearly identical capa¬ 
bility. It does, however, lack the CP’s 
autorouting of incoming faxes. 

The CP and Xpress can both be 
programmed using a script language. 
Support for the language is supplied 
with the package, but I had no imme¬ 
diate use for it. The CP has an option¬ 
al C-based programmer’s interface 
selling for $995, which can be used to 
build fax capabilities into applica¬ 
tions; but, again, I didn’t use it. 

The GammaFax CP would be best 
used in a fax network with the CP as a 
fax switch and the Xpress in local ma¬ 
chines. The press kit that came with 
the CP included a sample fax output 
of artwork created with Adobe Illus¬ 
trator. I asked the company to fax me 
a copy so that I could check the quali¬ 
ty produced by sending PostScript 
files from one GammaFax to another. 
The quality was as outstanding as the 
press kit sample. The quality is very 
good even when you are communicat¬ 
ing with a non-GammaFax board. 

The CP has more features than a 
fax machine. You can easily convert 
PCX, CUT, and Epson printer files to 
the type-3 TIFF files used by Gam¬ 
maFax. Or you can convert TIFF files 
to PCX or CUT formats, which allows 
you to include graphics and scanner 
output, and fax output wherever PCX 
or CUT files can be used. When you 
send a fax with non-TIFF input, the 
conversion to TIFF is performed on 
the CP card. 

You can print on a variety of print¬ 
ers, including Epson, HP LaserJet, 
and PostScript. You can also view a 
fax on your monitor, but I found the 
quality lacking. It’s evident that Gam- 
maFax’s expertise is in fax cards, not 
viewing programs. 

Printer support could use some 
work. I use an Okilaser 840 printer 
that can operate in either PostScript 
or HP LaserJet II mode. Since I was 
sharing it with a Sun Sparc worksta¬ 
tion through an A/B switch, I kept it 
configured as a serial PostScript print¬ 
er. While the GammaFax supports 
PostScript printers, it couldn’t handle 
the serial port. I was forced to output 
PostScript to a file and then print it 
using a utility that knows how to print 
PostScript on a serial port. The exces¬ 
sive printing time required for the 
huge PostScript files the CP created 
made this approach unrealistic for 


large faxes, so I removed the A/B 
switch and connected a parallel cable 
from the PC to the printer and a serial 
cable from the workstation to the 
printer. Now, when I want to print a 
fax, I switch to LaserJet emulation. 

A (generally) user-friendly menu 
and the Queue Manager make using 
the GammaFax CP fairly easy. The 
Queue Manager, which runs in the 
background and eats up 93 Kbytes, 
handles background fax operations. 
One annoying feature about the 
Queue Manager is that once loaded, it 
always answers the phone and can’t 
be unloaded. Although I get around 
this by loading the Queue Manager 
when I’m ready to use the fax and re¬ 
booting when done, I wish the compa¬ 
ny would do something about this. 

After plugging my phone line into 
one of the jacks in the back of the CP, 
I could send and receive a fax. I was 
impressed with how simple it is to set 
up the GammaFax CP until I picked 
up my phone. I could hear a dial tone, 
but at a much lower volume than usu¬ 
al. And I couldn’t break the dial tone 
by pressing a key. 

After unplugging the phone line 
from the CP, I called technical sup¬ 
port. The answer I got was incredible. 
Don’t get me wrong in what follows, 
because I really like this card and 
strongly recommend it or the Xpress 
to anyone who needs a good fax card, 
but the problem was the result of a 
decision. 

A standard phone line usually con¬ 
sists of four wires: red, green, yellow, 
and black. With a one-line system, red 
and green provide the voice line. 
Sometimes yellow and black are used 
for a second line or as a feed from a 
low-power transformer for phones 
with lighted dials. Currently, no inter¬ 
national standard exists for handling 
the yellow and black input. So, rather 
than using common sense and passing 
the input through unchanged, the 
company connected the wires. To get 
the CP to work with my phone, the 
company told me to cut the yellow 
and black wires leading into the card. 

Readers may contact GammaLink 
Graphics Communications at 133 Cas¬ 
pian Ct., Sunnyvale, CA 94089; (408) 
744-1549, fax (408) 744-1549; or circle 
Reader Service Number 25. 

— T.L. (Frank) Pappas, Aweb Soft¬ 
ware Technology 


OT5 760. Large-capacity SCSI hard 
disks, like this one from Orcha Tech¬ 
nology, provide lots of obvious bene¬ 


fits, especially with DOS 5.0’s capabil¬ 
ity for creating large partitions. How¬ 
ever, as I found out, there are poten¬ 
tial problems with SCSI adapter 
compatibility. If you plan on buying a 
SCSI disk, make sure you buy one 
that is compatible with the hardware 
and software you will be using. 

The OT5 760 is a full-height SCSI 
drive, with an unformatted capacity of 
765 Mbytes. After formatting with 
Adaptec’s AHA-1542B SCSI adapter, 
the usable capacity is 640 Mbytes, as 
reported by FDISK. When you decide 
on a large disk, remember that there 
will be a significant difference be¬ 
tween the formatted and unformatted 
capacities. 

I used the Orca/Adaptec combina¬ 
tion in a 33-MHz Arche Legacy 386.1 
ran Norton Utilities benchmarks with 
the Legacy cache and fast-disk fea¬ 
tures turned on. The disk speed test 
reported a 10.1 disk index, compared 
to the 8.4 benchmark for a 33-MHz 
Compaq 386. The overall system in¬ 
dex was 27.3 compared to 25.9 for the 
Compaq. The average seek time was 
13.3 ms, compared to 14 ms in Orca’s 
literature. 

The suggested retail price of the 
Orca drive is $1,650, which doesn’t in¬ 
clude the SCSI cable or the adapter. 
With a mean time between failure of 
150,000 hours, your investment in the 
drive would seem secure. 

A quick check in the back of a PC 
magazine revealed that the price of a 
comparable drive with an average 
seek time ranging from 14.5 to 16 ms 
is about $1,390. However, since the 
OT5 760 is new, it’s possible that the 
cost could become more competitive. 

While I’m quite happy with the 
drive, the Always Technology IN-2000 
SCSI adapter Orca originally sent 
with the disk is another story. The 
adapter worked properly with DOS 
5.0, but when I tried it for version 1.1 
of Open Desktop, I had problems. 

First, the Always adapter comes 
with software drivers for Xenix and 
Novell, but I had to request one for 
Unix. Since SCO Open Desktop sup¬ 
ports only Adaptec and Western Digi¬ 
tal SCSI adapters, you must use an ex¬ 
isting SCO system to create a special 
boot disk for installing the SCSI disk. 
If you don’t have an existing system, 
you have to send your original (not a 
copy) Open Desktop boot disk and an 
empty one to Always and have them 
build the special boot disk for you. 
Since I had an existing system, I tried 
to build the boot disk. When this was 
unsuccessful, Always personnel told 
me it was a known problem that oc¬ 
curred on some systems, but they 
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hadn’t been able to figure out the 
cause. They wanted me to send the 
Unix boot floppy. 

In the meantime, for reasons I 
won’t go into here, Arche decided to 
replace the motherboard. Once this 
happened, the Always adapter usually 
wouldn’t boot. When I booted DOS 
with a floppy, the hard disk wouldn’t 
always be recognized. I couldn’t de¬ 
termine the cause of the problem. 

I called the Orca people and asked 
them to send the Adaptec controller, 
specifically asking for the one with the 
floppy controller. After installing this 
adapter right from the box, both DOS 
5.0 and Unix worked properly. 

The only problem I ran into then 
was that SCO Open Desktop’s DOS 
support isn’t truly compatible with 
DOS 5.0. SCO reports that many us¬ 
ers can use a DOS 5.0 partition from 
Open Desktop if the partition is no 
larger than 32 Mbytes and is installed 
by observing a few guidelines. Since 
the SCSI adapter maps the 16 heads 
of the Orca disk to 64 heads, one of 
these guidelines required me to leave 
a 32-Mbyte empty partition to get 
DOS 5.0 and Unix on the disk. Unfor¬ 
tunately, Open Desktop can’t seem to 
find the DOS partition, so I can’t ac¬ 
cess it from Open Desktop. 

Readers can contact Orca Technol¬ 
ogy Corp. at 1751 Fox Dr., San Jose, 
CA 95131; (408) 441-1111; or circle 
Reader Service Number 26. 

— T.L. (Frank) Pappas, Aweb Soft¬ 
ware Technology 


PackRat. Polaris Software’s Win¬ 
dows-based Personal Information 
Manager is for people who need or 
like PIMs. The product is well thought 
out, offering a very complete set of 
PIM functions with a clear and consis¬ 
tent user interface. 

PackRat lets you manage phone 
books, phone logs, to-do lists, index 
cards, indexable note pads, financial 
accounts, and personal calendars. The 
package also contains a comprehen¬ 
sive project manager with Gantt chart 
output and critical path analyses in 
which discrete items can be defined or 
items from other PIM functions can 
be “attached.” You can access any 
function, its lists, and list operations 
by icon-clicking from Windows pull¬ 
down menus, or directly from the key¬ 
board. List-management operations 
(like item addition, deletion, Boolean 
search, etc.) are consistent from func¬ 
tion to function. Despite a somewhat- 


ponderous, on-line and text-augmented 
tutorial, you can master the basic oper¬ 
ations of all functions fairly quickly. 

This versatile software lets users 
customize the display of icons and op¬ 
tions. In addition, calendars can be 
viewed in daily or weekly detail, or by 
a month at a glance. Gantt chart views 
are also available for date-related en¬ 
tries, which can be displayed any¬ 
where from hourly to yearly scale. 

You can also select your most com¬ 
monly used view of a calendar and/or 
Gantt chart as the default display. 

One of the more diverse features 
lets you selectively attach items from 
different function lists. This allows 
you to relate data across functions and to 
access otherwise unrelated information 
by using key or catalog searches. 

One of the reasons that I don’t use 
PIMs is that their benefits do not jus¬ 
tify the effort required to enter and 
maintain the data. I typically use the 
Windows Cardfile accessory to main¬ 
tain a name/address/notes/etc. data¬ 
base; I also keep a handwritten day 
timer schedule and phone book that I 
carry around with me. More than any¬ 
thing, I would like a Windows-based 
PIM that allows me to update a 
phone/schedule database and to print 
updated, pocket-sized inserts for my 
physical day timer. If I had those basic 
features, I might start to use the more 
sophisticated features offered by 
packages like PackRat. (PackRat also 
documents I/O file compatibility for 
the Sharp Wizard. I did not test that 
feature.) 

To this end, I tried to use the Pack¬ 
Rat import function for my already 
existing CRD-formatted card files. Of 
course, because of that application’s 
nonstandard, free-flowing text format, 

I was unsuccessful. The import func¬ 
tion recognizes (and generates upon 
export) comma-, column-, or tab-de- 
limited fields. The printing features of 
PackRat are more versatile, however, 
providing default output formats for 
all function displays, customized re¬ 
port formatting, and label and enve¬ 
lope printing. In addition, Version 4.0 
provides output templates for Rolo¬ 
dex cards and for standard Daytimer 
sizes. 

All in all, PackRat is quite a useful 
PIM for those who need this kind of 
application. Its interface is very versa¬ 
tile and well designed, and its search 
and display performance is excellent. 

If you are in the market for a pass- 
word-protectable single-user or multi¬ 
user networked personal information 
manager, PackRat is an excellent 
choice. 

Note that Polaris PackRat Version 


3.0 for Windows 3.0 will not run in 
RealMode. Suggested retail pricing is 
$395 with a networked version avail¬ 
able at $695 for three users and $150 
for each additional user. 

Readers can contact Polaris Soft¬ 
ware at 1820 S. Escondido Blvd., Ste. 
102, Escondido, CA 92025; (619)743- 
7800; or circle Reader Service Num¬ 
ber 27. 

— Sorel Reisman, California State 
University, Fullerton 


DisplayMate. Visual ergonomics 
has always been a concern within the 
computer industry. The DisplayMate 
Video Display Utilities from Sonera 
Technologies helps answer that con¬ 
cern by providing a mechanism for 
evaluating and adjusting the perfor¬ 
mance of the display system on a PC. 
DisplayMate provides a myriad of 
tests and procedures for video moni¬ 
tors and boards for IBM PC and com¬ 
patible computers. Users can run their 
display systems through the paces. 

The program has two modes, covering 
the nontechnical and technical user. 
Each mode has an extensive array of 
tests, mostly passive, that allow the 
user to visually inspect the quality and 
compatibility of any monitor and dis¬ 
play hardware combination. 

With DisplayMate, the nontechnical 
user can fine-tune the settings for the 
display system. During testing, the 
user can make choices and read sug¬ 
gestions that will help with the task of 
adjusting the settings on the display 
system. I ran the tests and followed 
the suggestions prior to writing this 
review, with impressive results. 

The technical user can employ Dis¬ 
playMate to diagnose display system 
problems, evaluate configurations, 
and perform setup procedures for op¬ 
timum system performance. The tests 
are more than adequate to evaluate 
all modes of performance for any giv¬ 
en configuration. 

The displays themselves are quite 
interesting and valuable. They pro¬ 
vide a mechanism for testing a com¬ 
prehensive set of criteria. Extensive 
color and shading tests provide a 
method for precise adjustment of the 
display, as well as pointing out serious 
flaws in the design. Another set of ex¬ 
tensive tests discerns the CRT timing, 
scan adjustments, coordinate precise¬ 
ness, and pixel sharpness. 

The mode test suites provide the 
level of compatibility for a display sys¬ 
tem with the requirements of the 


90 


COMPUTER 








MDA, CGA, EGA, MCGA, VGA, 
Hercules Monochrome, and Incolor 
standards. Since seven modes is a 
partial count of the available stan¬ 
dards, and more standards are in the 
evolution phase, compatibility is an 
important issue. 

DisplayMate is available for IBM 
PCs and compatibles at $79. 

Readers can contact Sonera Tech¬ 
nologies, PO Box 565, Rumson, NJ 
07760; (908) 747-6886: or circle Read¬ 
er Service Number 28. 

— Ed Gordon, Bdata Systems 


SourceProfiler. This addition to the 
Microsoft library evaluates executable 
programs, reporting time spent on 
each function or line, how many times 
functions are executed, and whether 
certain lines are executed. It works 
well as an optimal control tool during 
programming, as well as an evaluatory 
package for existing code. 

SourceProfiler evaluates a program 
as it runs and writes a report to an 
output file. Four status reports — 
counting, timing, coverage, and sam¬ 
pling — may be generated, separately 
or in one file. Counting returns the 
number of times a function or line is 
executed during a program, both in 
number count and percentage of cov¬ 
erage for each line or function. Tim¬ 
ing records how much time is spent 
with each function or line. Coverage 
records which lines have been execut¬ 
ed, and sampling periodically inter¬ 
rupts the program and records what 
functions or lines are being executed. 

Each status report can be generated 
for either whole functions or individu¬ 
al lines. As the user manual suggests, 
it is most beneficial to first profile a 
program by function, then focus on 
trouble spots and profile those areas 
by line. Counting and timing status 
profiles seem to be the most benefi¬ 
cial, particularly when used in con¬ 
junction with each other. Both give in- 
depth information about areas in the 
program that use the most runtime. 
Coverage information also becomes 
helpful in cases where program output 
is not as it should be, answering such 
questions as whether all the right lines 
are being executed or a loop is work¬ 
ing correctly. Profiling for function 
coverage and especially for line cover¬ 
age reduces the usual stop-and-go 
running of a program to see what 
might be happening. 

I found the sampling profiler fair. It 
seems to be less accurate than the oth¬ 


er evaluators, and I felt that it was less 
helpful in optimization and debugging. 

The only real drawbacks I found 
were related to the software pre¬ 
requisites, which were more a nui¬ 
sance than a serious worry. First, 
SourceProfiler works only with the 
latest versions of Microsoft languages 
(that is, Basic (6.0+), C (5.1+), Cobol 
(5.1+), Fortran (5.1+), QuickBasic 
(4.0+), Macro Assembler (5.1+), 
QuickC (2.0+), and QuickC with 
QuickAssembler (2.01+)). This, of 
course, would not be a problem to 
those already possessing these lan¬ 
guages. Second, programs must be 
compiled with debug information to 
be profiled. These two drawbacks 
only present a problem in profiling 
previously written programs that one 
may wish to optimize. 

The user guide suggests effective 
use of SourceProfiler to optimize code 
and ways to profile faster and more 
efficiently. SourceProfiler is a small 
package, but it is a good companion to 
a collection of Microsoft languages 
and proves useful in reducing time 
spent on fine-tuning code. The pack¬ 
age has a suggested retail price of $79. 

Readers can contact Microsoft Corp. 
at One Microsoft Way, Redmond, WA 
98052-6399; (206) 882-8080; or circle 
Reader Service Number 29. 

— Karin Feeney, MOCO Inc. 


Microsoft Mouse Software,Version 
8.1. This update lets you specify three 
cursor sizes (small, medium, and 
large) and allows the cursor to “grow” 
so that it changes from small to large 
as you move it across the screen. For 
those of you with LCD screens, both 
options are a reasonable solution to 
the cursor that vanishes when moved 
rapidly. 

Another interesting addition is the 
cursor’s color (normal, reverse, or 
transparent). Both features can be set 
in Windows by using the mouse con¬ 
trol panel. In plain DOS mode, you 
use a new Pointer command to set the 
size, color, growth, threshold, and de¬ 
lay (the last two affecting when the 
cursor grows). Pointer settings are 
kept in the MOUSE.INI file. 

Version 8.0 was the first to support 
Microsoft’s BallPoint mouse. Users 
who purchased that device will get the 
upgrade free if they registered their 
product. For the rest of us, the up¬ 
grade price is $25 plus shipping and 
handling. 

Readers can contact Microsoft at 


One Microsoft Way, Redmond, WA 
98052-6399; (800) 426-9400; or circle 
Reader Service Number 30. 

— Richard Eckhouse, University of 
Massachusetts-Boston 


CompuWisk. Static is a nasty little 
problem with computer monitors. It 
causes that crackling sound you hear 
when you turn your monitor off and 
the zaps when you touch the screen. 
We can all live with that, I suppose, 
but the real problem is that the static 
attracts dust. So we use our sleeves, 
our handkerchiefs, a paper towel, or 
anything handy to remove the dust, 
leaving streaks on the screen and get¬ 
ting zapped! 

An easy way to prevent all of this? 
CompuWisk. This $39.95 antistatic 
brush is a blend of natural hair and 
dissipative fibers that you use to keep 
your screen clear and discharged. The 
brush is 4 inches wide and 2.5 inches 
high, and comes with a 12-foot coiled 
cord that you attach to the case of 
your monitor or PC. A velcro dot lets 
you store the brush on the side of 
your monitor or at the edge of your 
keyboard. And yes, you can even use 
the brush to keep your keyboard 
clean, although it won’t work for 
mopping up coffee spills. 

I’ve been using this brush for two 
months, and I love it. Anytime I see 
dust on the screen, I wipe it down 
with the brush in just a few seconds. A 
quick shake to clear the brush (it can 
even be washed) and back it goes onto 
its hanger. I heartily recommend it. 

Readers can contact Kinetronics 
Corp. at PO Box 6178, Sarasota, FL 
34278-6178; (813) 388-2432; or circle 
Reader Service Number 31. 

— Richard Eckhouse, University of 
Massachusetts-Boston 



CompuWisk antistatic brush. 
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your submission by the same date to Susan Davidson at the following address: 
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NEW PRODUCTS 


News on the network 


Console management for distributed heterogeneous systems 


Digital Equipment Corp. has re¬ 
leased Version 1.4 of the VAXcluster 
Console System with a DECwindows 
interface for managing distributed, 
heterogeneous computing environ¬ 
ments. Features include an emulation 
package that runs console simulations 
without affecting operations, a scan- 
profile capability that expands the 


number of detected events, and addi¬ 
tional system-control functions on 
fault-tolerant VAXft platforms. 

The system manages consoles from 
devices that support RS-232 commu¬ 
nications, sends ASCII text, and sup¬ 
ports Xon/XOff and I/O buffering 
functions. It logs console data, 
searches text for character strings, 


notifies users of matching strings, and 
lets users connect managed devices. 
Users can also control devices in un¬ 
attended environments from a tele¬ 
phone on a modem. 

The system (available in April 
1992) costs $1,384 per connection. 


Reader Service Number 36 


Expanding the network 


Client-server application development 


The Novaport Asynch Expander 
from Novadyne Computer Systems 
connects up to 128 serial devices from 
an Sbus expansion slot. 

The expander is hardware-compati¬ 
ble with such Sun Sbus systems as the 
IPC/IPX; Sparcstations 1,1+, and 2; 
and the multiprocessor Sparcserver 
600 MP series. A loadable Streams- 
based Unix device driver ensures Sun 
OS compatibility. 

Software-programmable data rates 
of up to 38.4 Kbps with full duplex 
are supported. Data rates of 9,600 bps 
are supported when 32 or more serial 
ports are operating simultaneously. 

Desktop and floor-standing models 
come with terminal concentrators and 
RS-232 cabling. The Desktop Expand¬ 
er encompasses four eight-port con¬ 
trollers that can be stacked to save 
space. The Deskside Expander has 
eight controllers that each support 
up to eight connections with four 
full-high SCSI peripheral bays for 
one tape drive and three hard disk 
drives. 

Both models feature user-program¬ 
mable software and hardware control 
modes; Unix character-processing se¬ 
lection; and odd-, even-, no-, and 
forced-parity capabilities. 

The Desktop Expander costs 
$2,995, and the Deskside Expander is 
$5,595. 


Reader Service Number 35 


Revelation Technologies has an¬ 
nounced the Openlnsight Windows- 
based application-development tool 
with local and remote access capabili¬ 
ties. Users can link to such programs 
as SQL Server and Oracle to develop 
client-server applications. 

Openlnsight lets developers create 
multiuser applications, specify two- 
phase commit transaction processing, 
and ensure referential integrity. The 
tool incorporates an active data dic¬ 


tionary to provide an integrated view 
of such data sources as dBase, ANSI 
SQL, ASCII, and DB2. The dictionary 
also centralizes data-validation rules. 

The package requires 4 Mbytes of 
RAM, 10 Mbytes of hard disk space, a 
386SX processor, Microsoft Windows 
Version 3.0, and DOS Version 3.1. 

Openlnsight costs $895, with a five- 
user LAN pack available for $1,495. 

Reader Service Number 37 



Openlnsight lets users create data-entry forms as a component of multiuser da¬ 
tabase network applications. 
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Laptop networking 

ADI Systems has introduced the 
AQ-EISA three-quarter-sized net¬ 
work controller card for EISA sys¬ 
tems and three pocket LAN adapters 
for laptops and notebooks. 

The AQ-EISA features a 32-bit bus 
architecture designed with National 
Semiconductor’s ST-NIC Ethernet 
chip. The three-in-one controller also 
comes with a 32-Kbyte buffer memory 
and uses a shared-memory architec¬ 
ture. Other features include CSMA/ 
CD media-access control, baseband 
signaling, a 10-Mbps data-transfer 
rate, and diagnostics with a loop-back 
test. 

The pocket adapters include Ether¬ 
net models AQ-ECX-PKT and AQ- 
ETP-PKT, and Arcnet model AQ- 
ARC-PKT. Each model uses a 
standard 25-pin parallel port and has 
no configuration switches. Each 
adapter eliminates the need for jump¬ 
er and switch settings. Since the 
adapters do not require I/O slots, 
computers without available slots can 
attach to a network. 


The Interoffice electronic mail and 
office-integration system is now 
linked with EzBridge Peer-to-Peer 
VAX/VMS-to-AS/400 communica¬ 
tions software. This joint development 
between Boston Software Works and 
Systems Strategies transfers messages 
and synchronizes directories, allowing 
OfficeVision/400 and All-in-1 users to 
transparently send and receive elec¬ 
tronic mail. Text memos, word-pro¬ 
cessing documents, and binary files 
can be transferred over a synchronous 
APPC/LU6.2 transport by using exist¬ 
ing commands and functions. 


The TeamNet 3.0 concurrent engi¬ 
neering environment for distributed 
configuration management includes 
an X Window graphical interface. The 
package features additional process 
control and conflict-resolution mecha¬ 
nisms, a floating-license facility, and 
enhanced NFS client access. 

Interactive file-merging capabilities 
allow a visual side-by-side comparison 
of conflicting file changes. 

The Unix-based environment from 



The Aquila Ethernet and Arcnet 
Pocket Adapters preserve battery 
power and feature 220Y and 110V 
power-input support. 


(Controller) Reader Service 
Number 38 
(Adapters) Reader Service 
Number 39 


Users can also choose terminal em¬ 
ulation and print capabilities. The 
LU6.2 transport accommodates 
SDLC, token-ring, or X.25 connec¬ 
tions. 

EzBridge/InterOffice packages 
have a scheduled beta availability of 
March 1992. EzBridge Peer-to-Peer 
costs $10,000 to $35,000, depending on 
system size and functionality. Interof¬ 
fice ranges from $4,500 to $24,500 per 
office system platform, depending on 
CPU size. 

Reader Service Number 40 


TeamOne Systems transparently 
tracks product development by using 
tools that run on platforms in NFS 
networks. Users can access TeamNet 
from NFS clients with a single com¬ 
mand. Tools can be integrated into 
the environment without modifica¬ 
tion. 

Licenses that support an average of 
two concurrent users cost $3,000. 

Reader Service Number 41 


DOS transaction routing 

Digital Equipment Corp. has an¬ 
nounced the DEC RTR (Reliable 
Transaction Router) for MS-DOS 
software. 

The client-only software is support¬ 
ed by an RTR for MS-DOS VMS 
server that routes transactions away 
from failed processors transparent to 
the PC user. 

The company claims predictable 
server response to MS-DOS client re¬ 
quests via dynamic load balancing. 
The package also lets users develop¬ 
ment MS-DOS applications using 
3GL languages. 

The DOS RTR supports the Win¬ 
dows graphical interface and DECnet 
and TCP/IP network protocols. It 
provides distributed software fault- 
tolerant operation and remote site- 
disaster recovery, automatically 
replays and reroutes transactions in 
the event of failure, and lets the sys¬ 
tem administrator define a hierarchy 
for higher priority messages. Proto¬ 
typing and monitoring tools are in¬ 
cluded. 

The license is $275 for each sup¬ 
ported CPU. 

Reader Service Number 42 


Scheduling through e-mail 

Microsystems Software has released 
four WAN versions of its CaLANdar 
workgroup-scheduling software. The 
package uses standard e-mail packag¬ 
es to distribute activity and availabili¬ 
ty information. 

DOS users can distribute CaLAN¬ 
dar activity through in-place in¬ 
terserver gateways in a multiserver 
environment. The package allows 
Microsoft Mail, cc:Mail, and Banyan 
Vine Mail administrators to maintain 
CaLANdar user lists within their e- 
mail naming system. 

Once a new database is linked to an 
existing one, all other servers are noti¬ 
fied automatically of the new server’s 
existence and its people, resources, 
and resource availability. Updated 
availability and people/resource lists 
can be accessed by users on all servers 
within the network’s normal e-mail la¬ 
tency time. 

An override option lets users sup¬ 
press appointment availability and 
top-level descriptions. 

Reader Service Number 43 
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Linking to DEC and IBM networks 


McData Corp. has announced that 
the LinkMaster 7100 network control¬ 
ler now features local and remote 
SNA gateway connectivity for Ether¬ 
net, concurrent SNA, and non-SNA 
channel protocols on the same IBM 
host channel. The controller also sup¬ 
ports concurrent gateway and down¬ 
stream physical unit functionality for 
token-ring or Ethernet LANs. Multi¬ 
ple host protocol combinations in¬ 
clude X.25, Bsc, and SDLC. 

Software release Version 2.0 can act 
as a DEC Local Area Transport ter¬ 
minal server, providing coaxial and 
ASCII-attached users access to DEC 
host applications via the Ethernet 
LAN. LAT emulation consolidates 
multiple links and speeds data trans¬ 
fer across a 10-Mbps Ethernet line. 

Reader Service Number 44 



LAT emulation 
for local and 
remote control 
links users to 
DEC and IBM 
networks. 


Management and protocol-analysis tools 


Administrative efficiency 

Saber Software has announced two 
products based on the LAN Adminis¬ 
tration Architecture. The system lets 
the network administrator work in a 
central environment on one interface. 

Tools called agents collect network 
data, which the administrator views on 
a console as reports, charts, “browses,” 
or text listings. The LA A Agent Tool¬ 
kit bundles six tools that provide appli¬ 
cation metering for DOS and Win¬ 
dows, virus detection, automatically 
built hardware inventory, event moni¬ 
toring and logging, job scheduling, file 
auditing, and disk-space monitoring. 
The LAA Console/RT provides DDE 
and clipboard features. 

Reader Service Number 45 


Release 2.0 of the Comtest LA net¬ 
work management and protocol anal¬ 
ysis tools from M-Trade fixes bugs 
and lets users identify potential data 
bottlenecks. It also allows the injection 
of loads for orderly network expansion. 

A new interface for frame capture 
allows toggle switching between four 
display modes. 

In Browse mode, the top of the 
screen displays single lines that show 
the protocol type, the function of the 
packet, and the destination and source 
address. The bottom section shows 
full decode of the highlighted frame. 

In Hex Examin mode, the top of the 
screen shows full decode while the 


bottom shows Hex values for the 
highlighted frame. The Dialogue 
mode splits the screen in two, while 
Normal is a one-line mode. 

Other features include marker set¬ 
tings to show the range of frames to 
be saved to disk or printed, an en¬ 
hanced search facility, an optional 
time stamp, and a marker that indi¬ 
cates where frame capture was re¬ 
started. Comtest LA also provides a 
report generator, collision detection 
within the first 12 bytes, and a graph 
facility for protocol distribution. 

The package costs $6,999. 

Reader Service Number 46 


Alternative LAN 


The Nighthawk Electron¬ 
ics’ DXS-16 data-exchange 
device lets several comput¬ 
ers share such peripherals as 
printers, modems, and file 
servers. 

The customizable device 
features a 16-Mbyte buffer 
memory and 16 serial or 
parallel ports. Users can 
also choose 3270 coaxial or 
twin-axial ports. A 500,000- 
bps throughput rate is possi¬ 
ble on all ports during si¬ 
multaneous transmission 



Users can configure the DXS-16 data-exchange device by 
using four keys on the front panel and a 40-character LCD. 


and reception. When the 
memory is full, the input 
data rate of acceptance is 
automatically reduced to 
prevent “out-of-time” soft¬ 
ware errors. 

IBM desktop users can 
configure the device 
through their computers. 

A programming option 
supports conversions and 
manipulations on data 
throughput. 

Reader Service Number 47 
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1C Announcemer 


Company, Model, Function 


AMP Dual read-out single in-line memory module sockets provide termination resistance of 

SIMM sockets 20 m£l maximum initial, dielectric withstanding voltage of 1 k V AC, insulation resistance 

of 10,000 Mf2 minimum initial, and capacitance of 1 pF maximum. Available in 64- through 
100-position sizes with an extractor built into their housing. 

Innoventions Portable, stand-alone tester with 16-bit processor tests all standard SIMM and SIP memory 

Simcheck modules. Optional adapters extend testing capacity to other modules and individual DRAM 

Memory tester chips. Expansion slot for special accessories such as a device handler that automatically feeds 

modules for unattended operation. 


Linear Technology 
LTlllO.LTllll 
DC/DC converters 


Linear Technology 
LT1241 family 
Pulse-width modulators 


Motorola 

68340V 

Integrated processor 


NMB Technologies 
AAA4M200 family 
4-Mbit DRAMs 


OPTi 

DXBB 

PC/AT chipset 


Rohm 
MCH series 
Chip capacitors 


Rohm 

MNR series 

Chip resistor networks 


Teledyne Components 
TC44xx family 
Power drivers 


Texas Instruments 
SN75LBC086 
Interface device 


Texas Instruments 

TSL220 

Light converter 


LT1110 operates with a supply voltage as low as IV; LT1 111 works with supplies as low as 
2 V. Available in 8-pin DIP or SO-8 surface-mount packages. Adjustable versions and fixed 
5 V and 12 V configurations. Quiescent current for all configurations is 300 pA. Cost (in 100s): 
$3.15/$2.40 (LT1110/LT1111 DIP); $3.60/$2.80 (LT1110/LT1111 SO-8). 

PWMs are pin-compatible with industry-standard UC1842s but have lower quiescent cur¬ 
rent (8 mA typical). Current sense delay is 50 ns. Start-up current is less than 250 pA. Cross 
conduction current spikes in the totem pole output stage have been eliminated, enabling 
500-kHz operation. Available in 8-pin SO packages and plastic DIPs. Cost (in 1,000s): $3.74 

Low-voltage version of 68300 family dissipates just 140 mW from a 3.3 V power supply when 
clocked at 8.3 MHz. Integrates the functions of up to 20 discrete components including 68020- 
based CPU32 core, DMA, serial I/O, timers, and system integration module. Packaged in 
144-pin ceramic quad flat packs and 145-pin plastic PG As. Samples available. Cost: $30.60. 

CMOS technology DRAMs support direct zero-wait-state access with up to 25-MHz micro¬ 
processors. RAS access times of40,45,50,60, and 80 ns, with read/write cycle times as fast as 
80 ns. Available with optional battery backup mode. Offered in 1 -Mbit x 4 and 4-Mbit x 1 
versions. Cost (in 10,000s): from $20 (40-ns versions); $16 (80-ns versions). 

Modular chipset includes 82C496 block-interleave DRAM and AT bus controller, 82C206 
peripheral controller, and optional 82C497 write-back caching controller. Supports 486 
DRAM bursting and up to 64-Mbyte high-speed local memory using two-page interleaving 
for DRAMs. Cost (in 10,000s): $27.50 (82C496 and 82C206); $17.50 (82C497). 

Ceramic chips are available in 0805 and 1206 sizes, with capacitance range of 0.5 pF to 
2,700 pF for NPO dielectrics and 680 pF to 0.22 pFd for X7r dielectrics. Dielectrics are fabri¬ 
cated with pure palladium electrodes. Designed for both wave and IR reflow soldering and 
automatic placement in 8-mm tape-and-reel format. Cost (per 1,000): from $15.80 to $45.10. 

MNR design incorporates large solder pads for strong bond between component and land 
pattern, and large concave electrodes to promote self-alignment in reflow soldering and de¬ 
tachment of solder between nodes in flow soldering. MNR02 is a two-pin unit, 1 mm x 1 mm x 
0.35 mm. MNR12 and MNR14 are complementary 4- and 8-pin networks. Available on 8-mm 
plastic tape. Cost (per 1,000/in 100,000s): $15 (MNR02), $18 (MNR12), $36 (MNR14). 

TC4421 and TC4422 can drive 9A at high-speed switching and short propagation delays. 
TT4401/04/05/06/07 are five new power logic quad drivers with 25-ns rise times and peak cur¬ 
rent drive from 1.5 A to 6 A. Family also includes new power logic quad drivers. All devices 
available from stock in plastic, SOIC, and CerDIPs. Cost (1,000): $2.95. 

Single-channel driver/receiver is designed for medium attachment units in 10-MHz Ether¬ 
net applications. Functional components include differential receiver and driver, receiver 
squelch with noise filter, jabber controls, and collision detection. Compliant with IEEE 
STD 802.31, Type 10B ASE-T. Available in 24-pin, 300-mil DIP. Cost (in 1,000s): $8.40. 

Sensor comprises a large-area photodiode and a patented BiMOS current-to-frequency 
converter. It can be connected directly to a microprocessor or digital control circuit. Features 
118-dB dynamic range, variable (and single) supply range from 5 V to 10V, linearity typically 
within 2% of FSR, and sensitivity to change of 0.01 % of FSR. Available in 8-pin clear plastic 
DIP. Cost (in 1,000s): $4.61. 
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□ _ FREE exhibit passes 

□ More information on the DB/EXPO'92 Technical Conference 
(early bird and alumni discounts are now available). 
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DB/EXPO '92Jackies today's most 

pivotal topics, including: 

• DATABASES 

• INTEROPERABILITY 

• OPEN SYSTEMS 

• DOWNSIZING/RIGHTSIZING 

• CLIENT/SERVER 

• CASE TOOLS 
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The Exposition floor includes over 400 exhibits, 
feoturing the industry's hottest products. See what's 
new with trend-setters and industry 
giants like Apple, Borland, Computer 
Associates, DEC, Fox Software, H-P, 

IBI, IBM, Microsoft, Novell, 

Oracle, Teradata 
and Wordtech. 
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rent (8 mA typical). Current sense delay is 50 ns. Start-up current is less than 250 pA. Cross 
conduction current spikes in the totem pole output stage have been eliminated, enabling 
500-kHz operation. Available in 8-pin SO packages and plastic DIPs. Cost (in 1,000s): $3.74 

Low-voltage version of 68300 family dissipates just 140 mW from a 3.3 V power supply when 125 

clocked at 8.3 MHz. Integrates the functions of up to 20 discrete components including 68020- 
based CPU32 core, DMA, serial I/O, timers, and system integration module. Packaged in 
144-pin ceramic quad flat packs and 145-pin plastic PG As. Samples available. Cost: $30.60. 

CMOS technology DRAMs support direct zero-wait-state access with up to 25-MHz micro- 126 
processors. R AS access times of 40,45,50,60, and 80 ns, with read/write cycle times as fast as 
80 ns. Available with optional battery backup mode. Offered in 1-Mbit x 4 and 4-Mbit x 1 
versions. Cost (in 10,000s): from $20 (40-ns versions); $16 (80-ns versions). 

Modular chipset includes 82C496 block-interleave DRAM and AT bus controller, 82C206 127 

peripheral controller, and optional 82C497 write-back caching controller. Supports 486 
DRAM bursting and up to 64-Mbyte high-speed local memory using two-page interleaving 
for DRAMs. Cost (in 10,000s): $27.50 (82C496 and 82C206); $17.50 (82C497). 

Ceramic chips are available in 0805 and 1206 sizes, with capacitance range of 0.5 pF to 128 

2,700 pF for NPO dielectrics and 680 pF to 0.22 pFd for X7r dielectrics. Dielectrics are fabri¬ 
cated with pure palladium electrodes. Designed for both wave and IR reflow soldering and 
automatic placement in 8-mm tape-and-reel format. Cost (per 1,000): from $15.80 to $45.10. 

MNR design incorporates large solder pads for strong bond between component and land 129 

pattern, and large concave electrodes to promote self-alignment in reflow soldering and de¬ 
tachment of solder between nodes in flow soldering. MNR02 is a two-pin unit, 1 mm x 1 mm x 
0.35 mm. MNR12 and MNR14 are complementary 4- and 8-pin networks. Available on 8-mm 
plastic tape. Cost (per 1,000/in 100,000s): $15 (MNR02), $18 (MNR12), $36 (MNR14). 

TC4421 and TC4422 can drive 9A at high-speed switching and short propagation delays. 130 

TT4401/04/05/06/07 are five new power logic quad drivers with 25-ns rise times and peak cur¬ 
rent drive from 1.5 A to 6 A. Family also includes new power logic quad drivers. All devices 
available from stock in plastic, SOIC, and CerDIPs. Cost (1,000): $2.95. 

Single-channel driver/receiver is designed for medium attachment units in 10-MHz Ether- 131 

net applications. Functional components include differential receiver and driver, receiver 
squelch with noise filter, jabber controls, and collision detection. Compliant with IEEE 
STD 802.31, Type 10BASE-T. Available in 24-pin, 300-mil DIP. Cost (in 1,000s): $8.40. 

Sensor comprises a large-area photodiode and a patented BiMOS current-to-frequency 132 

converter. It can be connected directly to a microprocessor or digital control circuit. Features 
118-dB dynamic range, variable (and single) supply range from 5 V to 10V, linearity typically 
within 2% of FSR, and sensitivity to change of 0.01 % of FSR. Available in 8-pin clear plastic 
DIP. Cost (in 1,000s): $4.61. 
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The Exposition floor includes over 400 exhibits, 
featuring the industry's hottest products. See what's 
new with trend-setters and industry 
giants like Apple, Borland, Computer 
Associates, DEC, Fox Software, H-P, 

IBI, IBM, Microsoft, Novell, 

Oracle, Teradata 
and Wordtech. 
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DATABASE AND 
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Microsystem Announcements 


Company, Model, Function 


Acromag 

A VME944x-i series 
Digital I/O boards 


Analyx Systems 

ADDA-1218 

Data acquisition board 


ATI Technologies 
8514 Ultra, Graphics 
Ultra, Graphics Vantage 
Accelerator boards 


Cognex 

4100 

Machine vision system 


Diversified Technology 

ESP2000 

EISA SBC 

Keithley Metrabyte 
DAS-1600 

Data acquisition board 


VMEbus boards provide 250V AC (354V DC) isolation to protect VMEbus systems inter¬ 
facing to field I/O. Models available with 16 input and/or 16 output channels; each output 
channel has an optically isolated solid-state relay operating as a single-pole, single-throw 
Form A relay. Digital input models feature selectable input threshold voltages. Digital out¬ 
puts sink up to 1 amp DC, from up to a 55 V DC source. Cost: from $750. 

A166-kHz isolated Sbus subsystem with 16-channel, 12-bit A/D conversions and 4-channel, 
18-bit D/A conversions. Fits into a single Sbus slot in any Sparcstation. Converters operate 
automatically and simultaneously. Includes on-board 128-Kbyte dual-ported RAM. Sam¬ 
pling rate is programmable from 6 ps to 1.536 ms per sample. Cost: $2,495. 

Boards use ATI Mach 8 graphics coprocessor. The 8514 Ultra is for Micro Channel, AT, or 
EISA systems; includes coprocessor VRAM and VGA pass-through connector. Graphics 
Ultra adds Super-VG A graphics to 8514 Ultra for AT or EISA computers; includes VG A- 
Wonderchip set, built-in mouse port, and 400-dpi inport mouse. Graphics Vantage uses 
DRAM instead of VRAM. Cost: from $599 (8514), from $699 (Ultra), from $499 (Vantage). 

Single-board system plugs directly into a single VME backplane slot and runs Cognex’s 
software tools to solve alignment, guidance, inspection, and identification problems in in¬ 
dustrial environments. Uses 68EC030 microprocessor, two 512 x512 frame buffers, SCSI 
controller, VME interface, and optional 68882 floating-point coprocessor. Incorporates 
VC-1 chip for high-speed implementation of image analysis routines. Cost: under $10,000. 

Single-board computer is based on 80486 processor and passive backplane architecture. 
Runs at 33 MHz, has 8.33-MHz bus speed and 8-Kbyte DRAM, and incorporates Intel’s 
82350DT chip set. Cost (without RAM): $2,595. 

Successor to DAS-16 board offers 16-channel, 12-bit analog input with 100-kHz maximum 
conversion rate, plus 32 bits of digital I/O, on-board pacer clock, and two 16-bit counters. 
Software driver performs all board I/O and register functions. Cost: $899. 


Multiaccess Computing 
MCC-1000F 
Communication 
controller 


Frame relay adapter card communicates directly with a metropolitan or wide area network 
providing frame relay service—a high-speed data service based on ANSI T1.617 and T1.618. 
The card occupies one NuBus I/O slot on the Mac II system board and provides on-board 
channel service unit/data service unit capability. Cost: $2,995. 


Mux Lab EML5250 is a 68000-based PC-emulation board that supports up to seven concurrent 5250 

EML5250, EMLG ATE sessions for background applications. Compatible with PC/Support 36,38, and AS/400. 
Connector boards EMLG ATE supports host access for any user on a LAN, allows simultaneous viewing of ses¬ 

sions from two different hosts, and enables LAN users to access different gateway servers. 
Both products feature on-board RAM and are available in both XT/AT and PS/2 versions. 


SemiDisk Systems Full-length IS A-bus card stores files in a semiconductor memory that eliminates the seek 

Blue Flame III and rotational delay of mechanical disks. Communicates with DOS through a device driver 

Solid-state “disk” that is both DOS and Concurrent DOS compatible. Capacity is 56 Mbytes per card and 896 

Mbytes per drive. Data path is 16 bits and transfer rate is 4 Mbytes/sec. 


Tadpole Technology Single-board 6U VME intelligent graphics controller is an X Window R4 server for stand- 

TP-IGC6V alone or VMEbus-based use. Includes on-board RGB connection. Supports pixel data trans- 

Graphics controller fers over Ethernet or VME backplane. Dual-processor architecture features 40-MHz 

TMS34020 for screen updates and drawing operations and 20-MHz MC68020 processor for 
client-server communications, protocol handling, and user interface. Cost: from $3,295. 


Viewgraphics Single-board frame buffer uses solid-state RAM for real-time access, storage, and display 

Viewstore of high-resolution video sequences and super high resolution raster images. Holds up to 384 

Frame buffer Mbytes of 24-bit true-color image data on a single 9U x 400-mm VME board. Plugs directly 

into most high-end workstations and computer products. Programmable display manager 
provides independent control of display output for multiple windows. Supports more than 25 
video display resolutions up to 1,920 x 1,152, including RS-170, EBU, EGA, VGA, and pro¬ 
posed HDTV standards. Cost: from $25,000 (96 Mbytes) to $49,000 (384 Mbytes). 
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Membership/Subscription Application 



BENEFITS 



Receive Computer automat¬ 
ically with membership. Writ¬ 
ten and refereed by experts, 
it features articles on the latest 
developments in computer 
technology, applications, and 
research, as well as survey 
and tutorial articles on topics 
covering the entire computer 
field. 

Computer's popular depart¬ 
ments include New Products, 
Product Reviews, Book 
Reviews, Standards, Open 
Channel (a reader forum), and 
Computer Society News. 

Other membership benefits include the following: 

• Discounts of up to 50% on Computer Society Press 
books and videos. Over 700 titles covering a broad 
spectrum of computer science topics including net¬ 
working, communications, advanced systems, image 
processing, security, artificial intelligence, and 
visualization 

• Discounts on registration to over 100 conferences, 
workshops, and tutorials each year 

• Low subscription rates on special-interest periodicals 

• Opportunity to participate in any of our 32 technical 
committees—networks of professionals with common 
interests in computer hardware, software, and 
applications 

• Opportunity to participate in the development of more 
than 200 standards projects sponsored by the society 

• Low member rates on COMPMAIL, the electronic mail 
service especially designed for computer 


Schedule of Fees 


To join: see item 1, 2, or 3. 
To subscribe: see item 4. 


Membership dues and periodical subscriptions are annualized to, and expire on, 
December 31. Pay full- or half-year rate depending on date of receipt by the 


ir 1-Aug 31 Sept 1-Feb 2 


I don’t belong to the IEEE and I want 
to join just the Computer Society 


) I want to join both the Computer Society 
■ and the IEEE* 

I reside in Region 1-6 (United States). □ $58.50 

I reside in Region 7 (Canada). □ $53.50 

I reside in Region 8 (Europe, Africa, orthe Middle East) □ $53.00 

I reside in Region 9 (Latin America). □ $49.50 

I reside in Region 10 (Asia and Pacific). □ $48.50 

*ACM members who join both IEEE and the Computer Society may deduct $5 off th 
full-year rate; $2.50 off the half-year rate. 


□ 

$58.50 

□ 

$117 

□ 

$53.50 

□ 

$107 

□ 

$53.00 

□ 

$106 

□ 

$49.50 

□ 

$ 99 

□ 

$48.50 

□ 

$ 97 


3 1 already belong to the IEEE and I want 
to join the Computer Society 


□ $11.00 


IEEE Member Number 


[ SPECIAL-INTEREST PERIODICALS for new or current members 


IEEE Computer Graphics and Applications . 

.6 

□ $12.50 

□ $ 25 

IEEE Design and Test . 

.4 

□ $11.00 

□ $ 22 

IEEE Expert . 

.6 

□ $10.00 

□ $ 20 

IEEE Micro . 

.6 

□ $11.50 

□ $ 23 

IEEE Software . 

.6 

□ $12.50 

□ $ 25 

Transactions on: 




Computers . 

....12 

□ $12.00 

□ $ 24 

Knowledge and Data Engineering 

.6 

□ $ 9.50 

□ $ 19 

Parallel and Distributed Systems . 

.6 

□ $ 9.50 

□ $ 19 

Pattern Analysis and Machine Intelligence 

...12 

□ $12.00 

□ $ 24 

Software Engineering . 

....12 

□ $11.00 

□ $ 22 

IEEE Annals of the History of Computing . 

.4 

□ $ 8.00 

□ $ 16 


□ Visa □ MasterCard □ American Express 


II I I I I I I I I I I I I I I I 


Charge Card Number (Minimum Charge $10) 

try of origin accepted fr 

bourg, Netherlands, Norway, P 
en, Switzerland, UK, USA, a 


Periodicals total $_ 

CA, DC residents add 
applicable sales tax $_ 

Membership fees $_ 

Total $_ 


CANADIAN residents 
add 7% GST; BELGIAN 
residents add 6°/o VAT $_ 

Total enclosed $_ 


I hereby make application for Computer Society membership and, if elected, will be governed by IEEE’s and the society's constitutions, bylaws, and statements of 
policies and procedures. 

MAILING ADDRESS _ 


Date of birth 



EDUCATION (highest level < 


OCCUPATION _ _ 

Title or Position Endorser's signature 



City State/Country Zip City State/Country Zip 


Mail or fax to appropriate Computer Society office: 

EUROPE: 13, Avenue de I’Aquilon, B-1200, Brussels, BELGIUM. Fax: 32-2-770-8505 

PACIFIC RIM: Ooshima Building, 2-19-1 Minami-Aoyama, Minato-ku, Tokyo 107, JAPAN. Fax: 81-3-3408-3553 

ALL OTHERS: 10662 Los Vaqueros Circle, PO Box 3014, Los Alamitos, CA 90720-1264 USA. Fax: 714-821-4010 




















































CALL FOR PAPERS 


Int’I Symp. on Design and Review of Soft¬ 
ware-Controlled Safety-Related Systems: 

Sept. 21-23,1992, Waterloo, Ont., Canada. 
Sponsor: Univ. of Waterloo. Submit paper 
to Diana Del Bel Belluz, Inst, for Risk Re¬ 
search, Univ. of Waterloo, Waterloo, Ont., 
N2L 3G1, Canada, phone (519) 885-1211, 
ext. 5136, fax (519) 888-6197. 

J. of Information Ethics , a new semiannu¬ 
al periodical, seeks submittals on ethics in 
the areas of information or knowledge pro¬ 
duction and dissemination. Publisher: Mc¬ 
Farland and Co. Before submitting a man¬ 
uscript, contact Robert Hauptman, LRS, 

St. Cloud State Univ., St. Cloud, MN 
56301, phone (612) 255-4822. 

Sixth Workshop on New Directions for 
Testing: May 21-24,1992, Montreal. Sub¬ 
mit 10 copies of 300- to 500-word abstract 
by Feb. 24,1992, to B. Kaminska, Ecole 
Polytechnique, PO Box 6079, Station A, 
Montreal, Que., H3C 3A7, Canada, phone 
(514) 340-4270, fax (514) 340-4147, e-mail 
bozena@vlsi.polymtl.ca. 

VLDB 92, 18th Int’I Conf. on Very 
Large Data Bases: Aug. 23-27,1992, 
Vancouver, B.C., Canada. Sponsors: 

VLDB Endowment, Canadian Information 
Processing Soc. Submit six copies of full 
paper (maximum length 5,000 words) by 
Feb. 28,1992, to Alberto Mendelzon, 
Computer Science Dept., Univ. of Toron¬ 
to, 10 King’s College Rd., Toronto, Ont. 
M5S 1A4 Canada, e-mail mendel@db. 
toronto.edu (in the Americas); Erich J. 
Neuhold, IPSI, Dolivostrasse 15, Box 
104326, D-6100 Darmstadt, Germany, 
e-mail neuhold@darmstadt.gmd.de (in Eu¬ 
rope); or Yahiko Kambayashi, IMEL, Fac¬ 
ulty of Eng., Kyoto Univ., Sakyo, Kyoto, 
606-01, Japan, e-mail yahiko@kuis.kyoto- 
u.ac.jp (in the Far East). 

IC 3 N, First Int’I Conf. on Computer 
Comm, and Networks: June 8-10,1992, San 
Diego, Calif. Sponsor: Int’I Soc. for Mini 
and Microcomputers. Submit four copies 
of full paper by Feb. 28,1992, and camera- 
ready manuscript by Apr. 27,1992, to Ki- 
joon Chae, Computer Science Dept., US 
Naval Academy, Annapolis, MD 21402, 
phone (410) 267-3080, fax (410) 267-4883, 
e-mail chae@usna.navy.mil. 

Dexa 92, Third Int’I Conf. on Database 
and Expert Systems Applications: Sept. 2- 
4, 1992, Valencia, Spain. Cosponsors: 
Technical Univ. of Valencia et al. Submit 
four copies of paper by Feb. 28,1992, to 
Isidro Ramos, Univ. Politecnica Valencia, 
Sistemas Informaticos y Computacion 
Dept., Apartado 22012, E-46020 Valencia, 
Spain, phone 34 (6) 3877-350, fax 34 (6) 
3877-359, e-mail iramos@dsic.upv.es. 


Int’I J. of Expert Systems: Research and 
Applications plans a special issue on veri¬ 
fication and validation of expert systems, 
especially with regard to applications of 
formal V&V methods. Submit five copies 
of article by Feb. 29,1992, to Alun D. 
Preece, Computer Science Dept., Concor¬ 
dia Univ., 1455 de Maisonneuve West, 
Montreal, Que. H3G 1M8, Canada, phone 
(514) 848-3014, fax (514) 848-2830, e-mail 
preece@concour.cs.concordia.ca. 


OOPSLA 92, Seventh Conf. on Object- 
Oriented Programming Systems, Languag¬ 
es, and Applications: Oct. 18-22,1992, 
Vancouver, B.C., Canada. Sponsor: ACM. 
Submit six copies of full paper by Mar. 1, 
1992, and camera-ready paper by July 20, 
1992, to Rebecca Wirfs-Brock, 16200 SW 
Pacific Hwy., Suite 187, Tigard, OR 97224. 

35th Midwest Symp. on Circuits and Sys¬ 
tems: Aug. 9-12,1992, Washington, DC. 


Call for articles and referees for Computer 

Computer seeks articles for inclusion in future special issues. 

Computer Support for Concurrent Engineering has been selected as the 
theme for the January 1993 issue. Manuscripts reporting survey, original re¬ 
search, design and development, and applications of computer support for con¬ 
current engineering are sought immediately. See p. 101 of the December 1991 
issue for more details. 

Fourteen copies of the complete manuscript are due by Mar. 15,1992; notifi¬ 
cation of decisions is set for July 15,1992; and the deadline for submittal of the 
final version of each manuscript is Sept. 1, 1992. 

Submissions and questions should be directed to K. Srinivas, Drawer 2000, 
Concurrent Engineering Research Center, West Virginia University, 955 Hartman 
Run Rd., Morgantown, WV 26505, phone (304) 293-7226, fax (304) 293-7541, 
e-mail splissue@cerc.wvnet.wvu.edu. 

Multichip modules has been selected as the theme for the April 1993 issue. 
Manuscripts reporting survey, original research, design and development, and 
applications of MCM technology are sought immediately. See p. 101 of the De¬ 
cember 1991 issue for more details. 

A 300-word abstract of the manuscript must be submitted by Apr. 15,1992; 14 
copies of the full manuscript are due by June 15,1992; notification of decisions 
is set for Oct. 15,1992; and the deadline for submittal of the final version of each 
manuscript is Dec. 1, 1992. 

Submissions and questions should be directed to P.R. Mukund, Dept, of Elec¬ 
trical Eng., Rochester Inst, of Tech., 1 Lomb Memorial Dr., Rochester, NY 14623, 
phone (716) 475-2174, e-mail prmee@ritvax.isc.rit.edu; or J.F. McDonald, Dept, 
of Electrical Eng., Rensselaer Polytechnic Inst., Troy, NY 12180, phone (518) 
276-2919, e-mail mcdonald@unix.cie.rip.edu. 


For submittal to Computer, manuscripts must not have been previously 
published or currently submitted for publication elsewhere. Each manu¬ 
script should be no more than 32 typewritten, double-spaced, single-sided 
pages long, including all text, figures, and references. Each of the 14 cop¬ 
ies submitted should include a title page that contains the title of the arti¬ 
cle, the full name(s) and affiliation(s) of the author(s), complete postal and 
electronic address(es) of all the authors as well as their telephone and fax 
number(s), a 300-word abstract, and a list of keywords identifying the cen¬ 
tral issues of the manuscript’s contents. The final manuscript should be ap¬ 
proximately 8,000 words in length and contain no more than 12 references. 

If you are willing to review articles for any special issue, please send a 
note listing your research interests to Jon T. Butler, editor-in-chief of Com¬ 
puter, at the Dept, of Electrical and Computer Eng., Naval Postgraduate 
School, Code EC/Bu, Monterey, CA 93943-5004, Internet butler@ece.nps. 
navy.mil. 
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Cosponsors: George Washington Univ. et 
al. Submit five copies of 500-word summa¬ 
ry by Mar. 1,1992, to Mona E. Zaghloul, 
Electrical Eng. and Computer Science 
Dept., George Washington Univ., Wash¬ 
ington, DC 20052, phone (202) 994-3772, 
fax (202) 994-0227, e-mail zaghloul@seas. 
gwu.edu. 

Third Int’l Workshop on VLSI for Artifi¬ 
cial Intelligence and Neural Networks: 

Sept. 2-4,1992, Oxford, England. Sponsor: 
Univ. of Oxford. Submit 500-word abstract 
by Mar. 1, 1992, to Jos6 G. Delgado-Frias, 
Electrical Eng. Dept., State Univ. of New 
York at Binghamton, Binghamton, NY 
13902-6000, phone (607) 777-4806, fax 
(607) 777-4822, e-mail delgado@bingvaxu. 
cc.binghamton.edu. 

ICMC 92, 1992 Int’l Computer Music 
Conf.: Oct. 14-18,1992, San Jose, Calif. 
Submit paper by Mar. 1,1992, to ICMC 92, 
Drawer 104, Music Dept., 1 Washington 
Sq., San Jose State Univ., San Jose, CA 
95192-0095, phone (408) 924-4675, fax 
(408) 924-4773, e-mail icms@sjsuvml. 
bitnet.edu. 

Concur 92, Third Int’l Conf. on Concurren¬ 
cy Theory: Aug. 24-27,1992, Stony Brook, 
N.Y. Submit paper by Mar. 1, 1992, to 
Ranee Cleaveland, Computer Science 
Dept., North Carolina State Univ., Ra¬ 
leigh, NC 27695-8206, phone (919) 515- 
7862, fax (919) 515-7382, e-mail rance@ 


DIAC 92 Workshop: May 3,1992, Berke¬ 
ley, Calif. Sponsor: Computer Profession¬ 
als for Social Responsibility. Submit four 
copies of proposal by Mar. 1,1992, to Dou¬ 
glas Schuler, 2202 N. 41st St., Seattle, WA 
98103, home phone (206) 632-1659, office 
phone (206) 865-3832, e-mail dschuler@ 
june.cs.washington.edu. 


Symp. on New Trends in Neural Networks: 

May 22-29,1992, Ghent, Belgium. Submit 
extended abstract by Mar. 1,1992, to F. 
Vandamme or Carine van Belleghem, 
Blandijnberg 2, 9000 Ghent, Belgium, 
phone 32 (91) 643-952, fax 32 (91) 644-197. 


Frontiers 92, Fourth Symp. on the 
Frontiers of Massively Parallel Com¬ 
putation: Oct. 19-21. 1992. McLean, Va. 
Cosponsors: NASA Goddard Space Flight 
Center et al. Submit six copies of paper 
summary by Mar. 2,1992, and camera- 
ready paper by July 1,1992, to H.J. Siegel, 
Frontiers 92, School of Electrical Eng., 
1285 Electrical Eng. Bldg., Purdue Univ., 
West Lafayette, IN 47907-1285, Internet 
front92@ecn.purdue.edu. 


13th Int'l Symp. on Electronics Manufac¬ 
turing Tech.: Sept. 28-30,1992, Baltimore. 
Cosponsors: IEEE Components, Hybrids, 
and Manufacturing Tech. Soc., Electronic 
Industries Assoc. Submit paper and 250- 
word abstract by Mar. 2,1992. and final 
manuscript by June 1,1992, to Michael P. 
Cassidy, AT&T, 3000 Skyline Dr., Mes¬ 
quite, TX 75149, phone (214) 284-2100, fax 
(214) 284-8182. 


10th Pacific Northwest Software Quality 
Conf.: Oct. 19-21,1992, Portland, Ore. 
Submit abstract by Mar. 2,1992, to Hilly 
Alexander, ADP Dealer Services Group, 
2525 SW First Ave., Portland, OR 97201- 
4760, phone (503) 294-4200. 

Hot Chips IV, Symp. on High-Per¬ 
formance Chips: Aug. 9-11, 1992, 
Stanford, Calif. Sponsor: IEEE Computer 
Soc. Technical Committee on Microproces¬ 
sors and Microcomputers. Submit one- to 
five-page extended abstract by Mar. 6, 
1992, to Bob Miller, Univ. of California, 
Computer Science Div., 533 Evans Hall, 
Berkeley, CA 94720, phone (510) 642- 
6037, fax (510) 642-5775, e-mail bmiller@ 
ginger.berkeley.edu. 

Second European Modula-2 Conf.: Sept. 8- 
9, 1992, Leicester, England. Sponsor: Lei¬ 
cester Polytechnic. Submit three copies of 
extended abstract by Mar. 9,1992, to Sue 
Brooks, Leicester Polytechnic, Marketing 
Centre, PO Box 143, Leicester LEI 9BH, 
England, phone 44 (0533) 577-577, fax 44 
(0533) 549-972. 


1992, to Yelena Yesha, Computer Science 
Dept., Univ. of Maryland at Baltimore 
County, Baltimore, MD 21228, phone 
(301) 455-3542, fax (301) 455-3969, e-mail 

yeyesha@algol.umbc.edu. 

10th Int’l Zurich Symp. on Electromagnet¬ 
ic Compatibility: Mar. 9-11, 1993, Zurich, 
Switzerland. Cosponsors: IEEE Switzer¬ 
land Chapter on Electromagnetic Compat¬ 
ibility et al. Submit 12 copies of abstract 
and five-page (maximum) summary by 
Mar. 15,1992, and manuscript by Oct. 31, 
1992, to Technical Program Committee 
EMC Zurich 93, ETH-Zentrum-IKT, CH- 
8092 Zurich, Switzerland. 

Gomac 92, Government Microcircuit Ap¬ 
plications Conf.: Nov. 8-12,1992, Las Ve¬ 
gas, Nev. Cosponsors: US Dept, of De¬ 
fense et al. Submit 35-word abstract and 
one copy of two-page (maximum) summa¬ 
ry by Mar. 16,1992, and camera-ready 
version by Aug. 14,1992, to Palisades Inst, 
for Research Services, Attn.: Jay Morreale 
(G-92), 201 Varick St., Suite 1140, New 
York, NY 10014. 


® RSP, Third Int’l Workshop on Rapid 
System Prototyping: June 23-25, 

1992, Research Triangle Park, N.C. Co¬ 
sponsors: IEEE Computer Soc. Technical 
Committees on Design Automation, Simu¬ 
lation, and Test Tech. Submit five copies of 
extended summary or full paper (pre¬ 
ferred) by Mar. 10,1992, and final manu¬ 
script by June 23,1992, to Nick Kanopou- 
los. Center for Systems Eng., Research 
Triangle Inst., 3040 Cornwallis Rd., Re¬ 
search Triangle Park, NC 27709, phone 
(919) 541-7341, fax (919) 541-6515, e-mail 
rsp@rti.rti.org. 

® llth Symp. on Reliable Distributed 

Systems: Oct. 5-7,1992, Houston. 
Cosponsors: IEEE Computer Soc. Techni¬ 
cal Committee on Distributed Processing 
et al. Submit five copies of manuscript by 
Mar. 15,1992, and camera-ready paper by 
July 10,1992, to Kishor S. Trivedi, Electri¬ 
cal Eng. Dept., Duke Univ., Durham, NC 
27706, phone (919) 660-5269, e-mail kst@ 
egr.duke.edu. 


Computer-Based Design Environments 
Symp.-Workshop: Aug. 18-19,1992, Ba¬ 
den-Baden, Germany. Submit abstract by 
Mar. 15,1992, and full paper by May 12, 
1992, to Jens Pohl, CAD Research Unit, 
Cal Poly, San Luis Obispo, CA 93407, fax 
(805) 756-5986. 

1992 Classification Soc. of North Am. An¬ 
nual Meeting: June 11-13,1992, East Lan¬ 
sing, Mich. Submit 200-word abstract by 
Mar. 15, 1992, to Wayne DeSarbo, Market¬ 
ing and Statistics Depts., School of Busi¬ 
ness, Univ. of Michigan, Ann Arbor, MI 
48109-1234, phone (313) 936-2084, e-mail 
wayne_s._desarbo@um.cc.umich.edu. Bit- 
net gegb@umichum. 

J. of Computer and Software Eng. plans a 
special issue in the fall of 1992 on distribut¬ 
ed and parallel computing systems. Submit 
five copies of full manuscript by Mar. 15, 


^ ASPLOS 5, Fifth Conf. on Architec- 
nAz tural Support for Programming Lan¬ 
guages and Operating Systems: Oct. 12-15, 
1992, Boston. Sponsor: ACM. Submit five 
copies of paper by Mar. 17,1992, and fi¬ 
nal version by July 13,1992, to Hank Levy, 
Computer Science and Eng. Dept., Univ. 
of Washington, Seattle, WA 98195, e-mail 
levy@cs.washington.edu. 

JICSLP 92, Joint Int’l Conf. and Symp. on 
Logic Programming: Nov. 9-13, 1992, 
Washington, DC. Cosponsors: Assoc, for 
Logic Programming, Univ. of Maryland 
Inst, for Advanced Computer Studies. Sub¬ 
mit six copies of 200-word abstract and 
5,000-word (maximum length) manuscript 
by Mar. 20,1992, to Krzysztof R. Apt, 
CWI, Kruislaan 413,1098 SJ Amsterdam, 
The Netherlands. 


Fifth ISMM Int’l Conf. on Parallel and 
Distributed Computing and Systems: Oct. 
1-3,1992, Pittsburgh. Sponsors: Int’l Soc. 
for Mini and Microcomputers, Pittsburgh 
Supercomputing Center. Submit four cop¬ 
ies of manuscript by Mar. 25,1992, and 
camera-ready version by July 31, 1992. to 
Rami Melhem, Computer Science Dept., 
Univ. of Pittsburgh, Pittsburgh, PA 15260, 
phone (412) 624-8426, e-mail melhem@ 
cs.pitt.edu. 

ATS 92, First Asian Test Symp.: 

Nov. 26-27, 1992, Hiroshima, Japan. 
Submit five copies of 50-word abstract by 
Mar. 28,1992, and camera-ready copy by 
July 25, 1992, to Hideo Fujiwara, Comput¬ 
er Science Dept., Meiji Univ., 1-1-1 Hi- 
gashimita, Tama-ku, Kawasaki 214, Japan, 
fax 81 (44) 934-7912, e-mail fujiwara@cs. 
meiji.ac.jp. 

Second Golden West Int'l Conf. on Intelli¬ 
gent Systems: June 1-3, 1992, Reno, Nev. 
Sponsor: Univ. of Nevada. Submit two- to 
three-page abstract by Mar. 30, 1992, to 
Carl G. Looney, Computer Science Dept., 


February 1992 
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Univ. of Nevada, Reno, NV 89577, phone 
(702) 784-4313, fax (702) 784-1766, e-mail 
looney@mammoth.cs.unr.edu. 

ISSRE 92, Third Int’l Symp. on Soft- 

ware Reliability Eng.: Oct. 7-9,1992, 
Research Triangle Park, N.C. Cosponsors: 
IEEE Computer Soc. Technical Commit¬ 
tee on Software Eng. et al. Submit five 
copies of full paper by Mar. 31,1992, and 
camera-ready version by Aug. 1,1992, to 
John C. Munson, Computer Science Div., 
Univ. of West Florida, Pensacola, FL 
32514, phone (904) 474-2989, e-mail 
jmunson@dcsll9.dcsnod.uwf.edu; or Taghi 
M. Khoshgoftaar, Computer Science and 
Eng. Dept., Florida Atlantic Univ., Boca 
Raton, FL 33431, phone (407) 367-3994, 
e-mail taghi@cse.fau.edu. 

Visualization 92: Oct. 19-23,1992, 
NSZ Boston. Sponsor: IEEE Computer 
Soc. Technical Committee on Computer 
Graphics. Submit four copies of 500-word 
paper by Mar. 31,1992, to Gregory M. 
Nielson, Arizona State Univ., Rural Rd., 
and University Ave., Tempe, AZ 85287- 
5406, phone (602) 965-2785, e-mail 
nielson@enuxva.eas.asu.edu. 

Supercomputing 92: Nov. 16-20, 

1992, Minneapolis, Minn. Submit 
manuscript by Apr. 1,1992, to Ann Hayes, 
Los Alamos Nat’l Lab, Bikini Rd., MS 
B287, Los Alamos, NM 87545, phone (505) 
665-4506, fax (505) 665-4361, e-mail ahh@ 
lanl.gov. 

^jjj) IEEE Design & Test of Computers 
^57 seeks papers (20 double-spaced pag¬ 
es, including figures) on the design and test 
of megabit memories for its March 1993 
special issue. Submit a 300-word abstract, 
keywords, and your mail and e-mail ad¬ 
dresses along with six copies of the paper 
before Apr. 1,1992, to Manuel d’Abreu, 
GE R&D, 1 River Rd., PO Box 8, KW- 
C308, Schenectady, NY 12301, phone (518) 
387-7097, fax (518) 387-5299, e-mail 
dabreu@crd.ge.com. 

CSM 92,1992 Conf. on Software 

Maintenance: Nov. 9-12, 1992. Orlan¬ 
do, Fla. Submit five copies of 250-word ab¬ 
stract and 1,000- to 5,000-word paper by 
Apr. 1,1992, and the final version by Sept. 
1, 1992, to Marc Kellner, Software Eng. 
Inst., Carnegie Mellon Univ., Pittsburgh, 
PA 15213-3890, phone (412) 268-7721, fax 
(412) 268-5758, e-mail mik@sei.cmu.edu. 


SIGDoc 92, 10th SIGDoc Int’l Conf.: Oct. 
13-16,1992, Ottawa, Canada. Sponsors: 
ACM Special Interest Group on Docu¬ 
mentation, Northern Telecom and Bell- 
Northern Research. Submit proposal by 
Apr. 3, 1992, to Assoc, for Computing Ma¬ 
chinery, 11 W. 42nd St., New York, NY 
10036, phone (212) 869-7440; or BNR, 
(613) 763-2134, fax (613) 763-2626, e-mail 
sigdoc92@bnr.ca. 


(Qjj) TAI 92, Fourth IEEE Conf. on Tools 
for Artificial Intelligence: Nov. 10- 
13,1992, Arlington, Va., Submit five cop¬ 
ies of full paper by Apr. 15,1992, to H.E. 


Stephanou, New York State Center for 
Advanced Tech, in Automation and Ro¬ 
botics, Rensselaer Polytechnic Inst., Troy, 
NY 12180-3590, phone (518) 276-6156 or 
(518) 276-6965, fax (518) 276-4897, e-mail 
hes@cat.rpi.edu. 

ICSC 92, Second Int’l Computer Science 
Conf.: Dec. 13-16,1992, Hong Kong. Co¬ 
sponsor: IEEE Hong Kong Section et al. 
Submit five copies of 5,000-word (maxi¬ 
mum length) manuscript by Apr. 15,1992. 
and camera-ready version by Sept. 15, 

1992, to Frederick H. Lochovsky, Comput¬ 
er Science Dept., Hong Kong Univ. of Sci¬ 
ence and Tech., Clear Water Bay, Kow¬ 
loon, Hong Kong, fax (852) 358-1477, 
e-mail fred@uxmail.ust.hk; or Dennis C. 
Tsichritzis, Centre Universitaire d’lnform- 
atique, University de Genhve, 12, rue du 
Lac, CH 1207 Geneve (Eaux-Vives), Swit¬ 
zerland, fax 41 (22) 735-3905, e-mail dt@ 
cui.unige.ch. 

17th Conf. on Local Computer Net- 
'n&Z works: Oct. 11-14,1992, Minneapolis, 
Minn. Sponsor: IEEE Computer Soc. 
Technical Committee on Computer Comm. 
Submit five copies of 250-word abstract 
and full paper by Apr. 20,1992, and cam¬ 
era-ready version by Aug. 3,1992, to Steve 
Bell, Hughes LAN Systems, MS 392, 1072 
S. Saratoga Svale Rd., San Jose, CA 95129, 
phone (415) 966-7926, fax (415) 966-7990, 
e-mail sbell@hls.com. 

ACM SIGSoft 92, Fifth Symp. on Software 
Development Environments: Dec. 9-11, 
1992, Washington, DC. Submit six copies 
of abstract and full paper (maximum 
length 6,000 words) by Apr. 21,1992, and 
camera-ready paper by Sept. 1,1992, to 
Herbert Weber, Fachbereich Informatik, 
Univ. of Dortmund, Baroperstrasse 301, 
4600 Dortmund 50, Germany. 

® ICARCV 92, Second Int’l Conf. on 
Automation, Robotics, and Comput¬ 
er Vision: Sept. 15-18,1992, Singapore. 
Cosponsors: Singapore Institution of Engi¬ 
neers et al. Submit four copies of 300- to 
500-word summary on neural networks in 
engineering and scientific applications by 
Apr. 30,1992, and final manuscript by 
June 30,1992, to Yoshiyasu Takefuji, Elec¬ 
trical Eng. Dept., Case Western Reserve 
Univ., Cleveland, OH 44106, phone (216) 
368-6430, fax (216) 368-2668, e-mail 
takefuji@axon.eeap.cwru.edu; and submit 
four copies on other subjects to ICARCV 
Secretariat, Associated Conventions and 
Exhibitions, 204 Bukit Timah Rd., #04-00, 
Boon Liew Bldg., Singapore 0922, phone 
(65) 799-5470, fax (65) 791-2687, e-mail 
emital@ntuvax.bitnet. 

ISCIS 7, Seventh Int’l Symp. on 
Computer and Information Sciences: 

Nov. 2-4,1992, Antalya, Turkey. Cospon¬ 
sors: IEEE Computer Soc. Turkey Chapter 
et al. Submit three copies of 10-page paper 
or four-page short communication by May 
1,1992, to Erol Gelenbe, Ecole des Hautes 
Etudes en Informatique, 45 rue des Saints- 
Peres, 75006 Paris, France, fax 33 (1) 4286- 
2231, e-mail erol@ehei.ehei.fr. 




February 1992 

ICICI 92, Int’l Conf. on Intelligent Control 
and Instrumentation, Feb. 18-21, Singa¬ 
pore. Cosponsors: IEEE Singapore Section 
et al. Contact R. Devanathan, IEEE Singa¬ 
pore Section, 200 Jalan Sultan, #11-03 Tex¬ 
tile Centre, Singapore 0719, phone (65) 
291-9690, fax (65) 292-8596, e-mail 
bitnet% “edevan@ntivax”. 

ISSCC 92,1992 IEEE Int’l Solid-State Cir¬ 
cuits Conf., Feb. 19-21. San Francisco. 
Sponsors: IEEE Solid-State Circuits Coun¬ 
cil et al. Contact Diane Suiters, Courtesy 
Associates, 655 15th St. NW, Suite 300, 
Washington, DC 20005, phone (202) 639- 
4255. 

Medical Imaging VI, Feb. 23-27, Newport 
Beach, Calif. Sponsor: Int’l Soc. for Opti¬ 
cal Eng. Contact SPIE, PO Box 10, Bell¬ 
ingham, WA 98227-0010, phone (206) 676- 
3290, fax (206) 647-1445. 


First Japan Information Access Project, 
Feb. 24, Washington, DC. Contact JIAP, 
1730 Massachusetts Ave. NW, Washington, 
DC 20036-1903, phone (202) 833-4545, fax 
(202) 728-9614. 


yK} Compcon Spring 92, Feb. 24-28. San 

Francisco. Contact Compcon Spring 
92, IEEE Computer Soc., 1730 Massachu¬ 
setts Ave. NW, Washington, DC 20036- 
1903, phone (202) 371-1013, fax (202) 728- 


CAAP 92, Colloquium on Trees in Alge¬ 
bra and Programming, and ESOP 92, Eu¬ 
ropean Symp. on Programming, Feb. 24- 

28, Rennes, France. Contact J.-C. Raoult, 
IRISA, Campus de Beaulieu, F-35042 
Rennes Cedex, France, e-mail raoult@irisa. 
fr (for CAAP 92); or B. Krieg-Brueckner, 
FB3 Mathematik und Informatik, Univer- 
sitaet Bremen, Postfach 330 440, D-2800 
Bremen 33, Germany, phone 49 (421) 218- 
3660, fax 49 (421) 218-3054, e-mail bkb@ 
informatik.uni-bremen.de (for ESOP 92). 

11th Int’l Workshop on Distributed Artifi¬ 
cial Intelligence, Feb. 26-28, Glen Arbor, 
Mich. Cosponsors: Univ. of Michigan, In¬ 
dustrial Tech. Inst. Contact Edmund H. 
Durfee, Electrical Eng. and Computer Sci¬ 
ence Dept., Univ. of Michigan, Ann Arbor, 
MI 48109, phone (313) 936-1563, fax (313) 
763-1260, e-mail durfee@caen.engin.umich. 
edu. 

Computer Law Assoc. Conf., Feb. 27-28, 

San Francisco. Contact Barbara Fieser, 
CLA, 8303 Arlington Blvd., Suite 210, 
Fairfax, VA 22031, phone (703) 560-7747, 
fax (703) 207-7028. 
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GLS-VLSI 92, Second Great Lakes 
Symp. on VLSI, Feb. 28-29, Kalam¬ 
azoo, Mich. Sponsor: Western Michigan 
Univ. Contact Naveed Sherwani, Comput¬ 
er Science Dept., Western Michigan Univ., 
Kalamazoo, MI 49008, phone (616) 387- 
5662, fax (616) 387-3999. 


March 1992 


1992 ACM/SIGAPP Symp. on Applied 
Computing, Mar. 1-3, Kansas City, Mo. 
Contact Hal Berghel, CAIES, 232 SCEN 
Computer Science, Univ. of Arkansas, 
phone (501) 575-7343, e-mail hlb@uafhp. 
uark.edu. 


{jgjN CAIA 92, Eighth IEEE Conf. on Ar- 
tificial Intelligence Applications, 
Mar. 2-6, Monterey, Calif. Contact CAIA 
92, IEEE Computer Soc., 1730 Massachu¬ 
setts Ave. NW, Washington, DC 20036- 
1903, phone (202) 371-1013, fax (202) 728- 


SIGForth 92 Workshop and Seminars, 

Mar. 4-7, Kansas City, Mo. Sponsor: ACM. 
Contact George Shaw, Shaw Labs, PO Box 
3471, Hayward, CA 94540-3471, phone 
(510) 276-5953, fax (510) 276-6050, e-mail 
george_shaw@mts.cc.wayne.edu, Compu¬ 
Serve 70413,2005, GEnie: g.shawl. 

Seventh Computers in Libraries Conf., 

Mar. 4-7, Washington, DC. Contact Mary 
Ellen Cisero, 11 Ferry Ln. W, Westport, 

CT 06880, phone (800) 635-5537 or (203) 
226-6967, fax (203) 454-5840. 

ISS 92, Second Irvine Software Symp., 

Mar. 6, Irvine, Calif. Sponsor: Irvine Re¬ 
search Unit in Software. Contact Robin 
Sharp or Mojgan Bonakdar, IRUS, Infor¬ 
mation and Computer Science Dept., 

Univ. of California, Irvine, CA 92717, 
phone (714) 725-2259, fax (714) 856-4056, 
e-mail rsharp@ics.uci.edu or bonakdar@ 
ics.uci.edu. 

Fuzz IEEE 92, Int’l Conf. on Fuzzy Sys¬ 
tems, Mar. 8-12, San Diego, Calif. Sponsor: 
IEEE Neural Network Council. Contact 
Fuzz IEEE 92, 5665 Oberlin Dr., Suite 110, 
San Diego, CA 92121, phone (619) 453- 
6222, fax (619) 535-3880. 

ISIF 92, Fourth Int'l Symp. on Integrated 
Ferroelectrics, Mar. 9-11, Monterey, Calif. 
Contact Alona S. Miller, ISIF 92, Micro¬ 
electronics Research Labs, Univ. of Colo¬ 
rado at Colorado Springs, PO Box 7150, 
Colorado Springs, CO 80933-7150, phone 
(719) 593-3488, fax (719) 594-4257. 


In the accompanying Calendar and adjoining Call for Papers, the IEEE 
Computer Society logo identifies the conferences the society is sponsoring 
or cooperating in. Other conferences of interest to our readers are also listed. 

The free notices are published in chronological order and as space permits. 

For inclusion in the Call for Papers section, please submit the name of the 
event, date(s), location, sponsor(s), deadline for submittals, and phone and fax 
numbers and — if applicable — the electronic address of the person to whom pa¬ 
pers should be sent. 

For the Calendar section, please provide the event name, date(s), location, 
sponsor(s), and phone and fax numbers and — again, if applicable — the elec¬ 
tronic address of the person to contact for complete information. 

Submitted information should arrive at Computer at least five weeks before 
the month of publication (i.e., for the April 1992 issue, send information for re¬ 
ceipt by February 20, 1992) to Chuck Governale, Calendar Dept., Computer, PO 
Box 3014, Los Alamitos, CA 90720-1264, fax (714) 821-4010, e-mail 
c.governale@compmail.com. 


CAD and Eng. Workstations 92 and Busi¬ 
ness Graphics 92, Mar. 9-12, Anaheim, 
Calif. Sponsor: National Computer Graph¬ 
ics Assoc., Contact NCGA, 2722 Merrilee 
Dr., Suite 200, Fairfax, VA 22031-4499, 
phone (800) 225-NCGA or (703) 698-9600, 
fax (703) 204-4521. 


Fifth Virus and Security Conf., Mar. 
11-13, New York City. Sponsor: Data 
Processing Management Assoc. Financial 
Industries. Contact Judy S. Brand, PO Box 
6313 FDR Station, New York, NY 10150, 
phone (800) 835-2246 ext. 190, fax (303) 
825-9151. 


Third Int’l Conf. on Data and 
Knowledge Systems for Manufactur¬ 
ing Eng., Mar. 9-13, Villeurbanne, France. 
Cosponsors: Assoc. Francaise pour la Cy- 
bernetique Economique et Technique et 
al. Contact INSA, 20 Av. Albert Einstein, 
69621 Villeurbanne, France, phone 33 (72) 
438-172, fax 33 (72) 440-800. 


Int’l Conf. on Computer Applications in 
Design, Simulation, and Analysis, Mar. 11- 

13, Orlando, Florida. Contact G.K.F. Lee, 
Mars Mission Research Center, PO Box 
7910, North Carolina State Univ., Raleigh, 
NC 27695-7910, phone (919) 737-2365, fax 
(919) 737-7968, e-mail glee@maeps0.ncsu. 


Fourth Int’l Conf. on Strategic Soft- 
ware Systems, Mar. 10-11, Hunts¬ 
ville, Ala. Cosponsors: IEEE Computer 
Soc. Huntsville Chapter, Univ. of Alabama 
in Huntsville. Contact Ann H. Yelle, Univ. 
of Alabama in Huntsville, Continuing Edu¬ 
cation Div., Conferences (SSS-92), Bevill 
Center 289-A, Huntsville, AL 35899, 
phone (800) 448-4035 or (205) 895-6372, 
fax (205) 895-6934 or 6760. 

ISQE 92, Int’l Software Quality Exchange, 
Mar. 10-11, San Francisco. Sponsor: Juran 
Inst. Contact Brian T. Eck, Juran Inst., 11 
River Rd., PO Box 811, Wilton, CT 06897- 
0811, phone (800) 338-7726. 

Southcon 92, Mar. 10-12, Orlando, Fla. Co¬ 
sponsors: IEEE, Electronic Representa¬ 
tives Assoc. Contact Southcon 92, PO Box 
92275 Worldway Postal Center, Los Ange¬ 
les, CA 90009-2275, phone (800) 877-2668, 
fax (310) 641-5117. 

Lyon Conf., Mar. 10-12, Lyon, France. 
Contact Solange Dubeauclard, 1030 N. 
Glenhurst, Birmingham, MI 48009, phone 
(313) 647-7833. 


Hannover Fair Cebit 92, Mar. 11-18, Han¬ 
nover, Germany. Contact Hannover Fairs 
USA, 103 Carnegie Center, Princeton, NJ 
08540, phone (609) 987-1202. 


CPAL 2, Second Regional Workshop on 
Computer Processing of Asian Languages, 
Mar. 12-16, Kanpur, India. Cosponsors: In¬ 
dian Inst, of Tech, et al. Contact R.M.K. 
Sinha, Computer Science and Eng. Dept., 
IIT, Kanpur 208 016, India, e-mail rmk@ 
kalyan.emet.in. 


Symp. on Document Analysis and Infor¬ 
mation Retrieval, Mar. 16-18, Las Vegas, 
Nev. Contact William L. Brogan, Electrical 
Eng. Dept., Univ. of Nevada at Las Vegas, 
4505 Maryland Pkwy., Las Vegas, NV 
89154, phone (702) 597-4184, e-mail 
eewlb@cs.unlv.edu; or Kazem Taghva, 
Computer Science Dept., UNLV, 4505 
Maryland Pkwy., Las Vegas, NV 89154, 
phone (702) 739-3338, e-mail taghva@cs. 
unlv.edu. 


® EDAC 92, European Conf. on De¬ 
sign Automation, Mar. 16-19, Brus¬ 
sels. Cosponsors: EDAC Assoc, et al. Con- 
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tact Herman Bcke, EDO Abdisstraat 34, 
3030 Leuven-Heverlee, Belgium, phone 32 
(16) 20-3063; or EDAC 92 Secretariat, 
CEP Consultants, 26-28 Albany St., Edin¬ 
burgh EH1 3QH, UK, phone 44 (31) 557- 
2478, fax 44 (31) 557-5749. 


Int’l 1992 Zurich Seminar on Digital Com¬ 
munications, Mar. 16-19, Zurich, Switzer¬ 
land. Sponsor IEEE Switzerland Chapter 
on Digital Comm. Contact Walter Schle- 
gel, Ascom Tech, Freiburgstr. 370, CH- 
3018 Berne, Switzerland, phone 41 (31) 
632-136, fax 41 (31) 555-211, e-mail 
schlegel@tech.ascom.ch. 

Enterprise Networking Event 92, Mar. 17- 

19, Washington, DC. Sponsors: Soc. of 
Manufacturing Engineers. Contact SME, 1 
SME Dr., PO Box 930, Dearborn, MI 
48121-0930, phone (800) 744-4SME or 
(313) 271-1500, fax (313) 271-2861. 

(frflv Packaging, Interconnects, Optoelec- 
Ironies for the Design of Parallel 
Computers, Mar. 17-18, Schaumberg, Ill. 
Sponsor: IEEE Lasers and Electro-Optics 
Soc. Contact Jose Schuh-Aine or Raj Mit- 
tra, Univ. of Illinois, 1406 W. Green St., 
Urbana, IL 61801-2991, phone (217) 244- 
7279, fax (217) 333-8986. 


(Qj) IEEE Multi-Chip Module Conf., 
nS?' Mar. 17-20, Santa Cruz, Calif. Co¬ 
sponsors: IEEE Circuits and Systems Soc. 
et al. Contact Wayne Wei-Ming Dai, 313A 
Applied Sciences Bldg., Computer Eng. 
Dept., Univ. of California, Santa Cruz, CA 
95064, phone (408) 459-4234, fax (408) 
459-4829. 


(qj) Built-In Self-Test Workshop, Mar. 

18-20, Charleston, S.C. Sponsor: 
IEEE Computer Soc. Technical Commit¬ 
tee on Test Tech. Contact Richard Sed- 
mak, Self-Test Services, 6 Lindenwold 
Terr. Ambler, PA 19002, phone (215) 628- 
9700, fax (215) 628-2106. 


Second Conf. on Computers, Freedom, 
and Privacy, Mar. 18-20, Washington, DC. 
Cosponsors: ACM et al. Contact Confer¬ 
ences and Institutes Office, George Wash¬ 
ington Univ., Washington, DC 20052, 
phone (202) 994-0723, fax (202) 994-7048, 
e-mail cfp2@seas.gwu.edu. 


TAU 92, Int’l Workshop on Timing Issues 
in the Specification and Synthesis of Digi¬ 
tal Systems, Mar. 18-20, Princeton, N.J. 
Sponsor: ACM. Contact Sharad Malick, 
Princeton Univ., Electrical Eng. Dept., 
Princeton, NJ 08544, phone (609) 258-4625, 
e-mail sharad@princeton.edu. 


Seventh Int'l Conf. on Tech, and Persons 
with Disabilities, Mar. 18-21, Los Angeles. 
Sponsor: California State Univ., North- 
ridge. Contact Disabled Student Services 
Office, CSUN, 18111 Nordhoff St. — 
DVSS, Northridge, CA 91330, phone (818) 
885-4929 or (818) 885-2578. 


Fourth Oregon Workshop on Software 
Metrics, Mar. 22-24, Portland, Ore. Co¬ 


sponsors: Portland State Univ., Oregon 
Center for Advanced Tech. Education. 
Contact Warren Harrison, Center for Soft¬ 
ware Quality Research, Computer Science 
Dept., Portland State Univ., PO Box 751, 
Portland, OR 97207-0751, phone (503) 
725-3108, fax (503) 725-3211, e-mail 
warren@cs.pdx.edu. 

EWPC 92, European Workshop on Paral¬ 
lel Computing, Mar. 23-24, Barcelona, 
Spain. Sponsor: Commission of European 
Communities. Contact EWPC 92 Secretari¬ 
at, c/o W. Joosen, Departement Computer- 
wetenschappen, K.U. Leuven, Celestijnen- 
laan 200 A, B-3001 Heverlee-Leuven, 
Belgium, phone (32) 1620-1015, fax (32) 
1620-5308, e-mail ewpc92@cs.kuleuven. 
ac.be. 


IPPS 92, Sixth Int'l Parallel Process- 
'^5?' ing Symp., Mar. 23-26, Beverly Hills, 
Calif. Contact Larry H. Canter, Computer 
Systems Approach, 1140 S. Raymond Ave., 
Suite B, Fullerton, CA 92631, phone (714) 
738-3414, fax (714) 738-3470. 


DB/Expo 92 Conf., Mar. 23-26, San Fran¬ 
cisco. Sponsor: NDN Enterprises. Contact 
NDN, 289 S. San Antonio Rd., #204, Los 
Altos, CA 94022, phone (800) 2DB-EXPO 
or (415) 941-8440, fax (415) 941-2066. 


I flll EDBT 92, Int’l Conf. on Extending 
Database Tech., Mar. 23-27, Vienna. 
Sponsor: IEEE. Contact Brigitte Haber- 
stroh, Inst. 184-2, Tech. Univ. of Vienna, 
A-1040 Vienna, Austria, phone 43 (1) 588- 
01/6122, fax 43 (1) 505-5304, e-mail 
haber@vexpert.dbai.tuwien.ac.at. 


1992 Conf. on Optically Based Methods 
for Process Analysis, Mar. 23-27, Somer¬ 
set, N.J. Sponsor: Int’l Soc. for Optical 
Eng. Contact SPIE, PO Box 10, Belling¬ 
ham, WA 98227-0010, phone (206) 676- 
3290, fax (206) 647-1445; SPIE, Xantner 
Strasse 22, D-1000 Berlin 15, Germany, 
phone (49) 228-219062; or SPIE, c/o OTO 
Research, Takeuchi Bldg., 1-34-12 Taka- 
tanobaba, Shinjuku-ku, Tokyo 160, Japan, 
phone 81 (03) 3208-7821, fax 81 (03) 3200- 
2889. 


DCC 92, Data Compression Conf., 
Mar. 24-26, Snowbird, Utah. Spon¬ 
sors: IEEE Computer Soc. Technical Com¬ 
mittee on Computer Comm., NASA. Con¬ 
tact Martin Cohn, Computer Science 
Dept., Brandeis Univ., Waltham, MA 
02554, phone (617) 736-2705, fax (617) 
736-2741, e-mail marty@cs.brandeis.edu. 

1992 Computer-Based Systems Eng. 
N!?' Workshop, Mar. 24-26, Washington, 
DC. Sponsor: IEEE Computer Soc. Task 
Force on Computer-Based Systems Eng. 
Contact Ashok Agrawala or Phil Hwang, 
Computer Science Dept., Univ. of Mary¬ 
land, College Park, MD 20742. 


Science Dept., Box 1910, Brown Univ., 
Providence, RI 02912, phone (401) 863- 
7643, fax (401) 863-7657, e-mail dpl@cs. 
brown.edu. 


lyTO SEDMS 92, Third Symp. on Experi- 
ences with Distributed and Multipro¬ 
cessor Systems, Mar. 26-27, Newport 
Beach, Calif. Sponsor: Usenix Assoc. Con¬ 
tact George Leach, AT&T Paradyne, MS 
LG-133, PO Box 2826, Largo, FL 34649- 
2826, phone (813) 530-2376, fax (813) 530- 
8224, e-mail reggie@pdn.paradyne.com. 

(frK) ASOM, Second Am. Symp. on Mi- 
croelectronics, Mar. 27-28, Memphis, 
Tenn. Sponsor: Memphis State Univ. Con¬ 
tact Eliayeb S. Abuelyaman, Electrical 
Eng. Dept., Memphis State Univ., Mem¬ 
phis, TN 38152, phone (901) 678-2050, fax 
(901) 678-4180. 


SETSS 92, Eighth Int’l Conf. on Software 
Eng. for Telecommunication Systems and 
Services, Mar. 30-Apr. 1, Florence, Italy. 
Sponsor: Institution of Electrical Engi¬ 
neers. Contact IEE Conf. Services, Savoy 
Place, London WC2R 0BL, UK, phone 
(071) 240-1871, fax (071) 240-7735. 


TOOLS Europe 92, Tech, of Object- 
Oriented Languages and Systems Int’l 
Conf., Mar. 30-Apr. 2, Dortmund, Germa¬ 
ny. Cosponsors: Societe des Outils du Lo- 
giciel and Georg Heeg. Contact Boris Mag- 
nusson, Lund Univ. Computer Science 
Dept., PO Box 118, S-22100 Lund, Sweden, 
fax (46) 4613-1021, e-mail boris@dna. 
lth.se; or SOL, 14 rue Jean Rey, 75015 
Paris, France, phone 33 (1) 4056-0358, fax 
33 (1) 4056-0581. 


April 1992 

^ PCCC 92,11th IEEE Int’l Phoenix 
vU' Conf. on Computers and Communi¬ 
cations, Apr. 1-3, Scottsdale, Ariz. Cospon¬ 
sors: IEEE Comm. Soc. et al. Contact 
Ralph Martinez, Univ. of Arizona, Electri¬ 
cal and Computer Eng. Dept., Tucson, AZ 
85721, phone (602) 621-6174, e-mail 
martinez%ecevax@rvax.ccit.arizona.edu; 
or Joseph Urban, Computer Science and 
Eng. Dept., Arizona State Univ., Tyler 
Mall-ECG 252, Tempe, AZ 85287-5406, 
phone (602) 965-2774, fax (602) 965-2751, 
e-mailjurban@asuvax.eas.asu.edu. 

Third Conf. on Applied Natural Language 
Processing, Apr. 1-3, Trento, Italy. Spon¬ 
sor: Assoc, for Computational Linguistics. 
Contact Oliviero Stock, IRST, 38050 Povo, 
Trento, Italy, phone 39 (461) 814-444, fax 
39 (461) 810-851, e-mail stock@irst.it; or 
Madeleine Bates, BBN Systems and Tech¬ 
nologies, 10 Moulton St., Cambridge, MA 
02138, phone (617) 873-3634, fax (617) 
873-3776, e-mail bates@bbn.com. 


1992 Conf. on Advanced Research in VLSI 
and Parallel Systems, Mar. 25-27, Provi¬ 
dence, R.I. Cosponsors: Brown Univ., 

MIT. Contact Daniel Lopresti, Computer 


SIGCPR 92, Apr. 5-7, Cincinnati, Ohio. 
Sponsor: ACM Special Interest Group on 
Computer Personnel Research. Contact 
Thomas Ferratt, Univ. of Dayton, Decision 
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Sciences Dept. 300 College Park, Dayton, 
OH 45469, phone (513) 229-2728, e-mail 
ferratt@dayton. 

Conf. on Programming Environments for 
Parallel Computing, Apr. 6-8, Edinburgh, 
Scotland. Cosponsors: Int’l Federation for 
Information Processing et al. Contact 
Rosemary Candlin, Computer Science 
Dept., Univ. of Edinburgh, King’s Bldgs., 
Edinburgh, UK EH9 3JZ, phone 44 (31) 
650-5137, fax 44 (31) 667-7209, e-mail 
rc@cs.ed.ac.uk. 


25th Simulation Symp., Apr. 6-9, Or- 

lando, Fla. Sponsor: Soc. for Com¬ 
puter Simulation. Contact Tom Kubiak, 
EDS/CPC HQ Rm. 228-25, 30601 Van 
Dyke, Warren, MI 48090-9020, phone 
(313) 575-0886, fax (313) 575-0966. 


Latin 92, Int’l Symp. on Latin Am. Theo¬ 
retical Informatics, Apr. 6-10, Sao Paulo, 
Brazil. Cosponsors: ACM et al. Contact 
Paulo Feofiloff, Univ. de Sao Paulo, Inst, 
de Matematica e Estatisticas, Caixa Postal 
20570, phone 55 (11) 813-9499, e-mail 
pfeofilo@brusp.bitnet. 


10th IEEE VLSI Test Symp., Apr. 7- 

vjy 9, Atlantic City, N.J. Sponsor: IEEE 
Computer Soc. Technical Committee on 
Test Tech. Contact IEEE Computer Soc., 
1730 Massachusetts Ave. NW, Washington, 
DC 20036-1903, phone (202) 371-1013, fax 
(202) 728-0884; or Dhiraj Pradham, Univ. 
of Massachusetts, ECE Dept., Marcus 
Hall, Amherst, MA 01003, phone (413) 
545-0160, fax (413) 545-4611. 


Fourth Int'l Conf. on Image Processing 
and its Applications, Apr. 7-9, Maastricht, 
The Netherlands. Sponsor: Institution of 
Electrical Engineers. Contact IEE Conf. 
Services, Savoy Place, London WC2R 0BL, 
UK, phone (071) 240-1871, fax (071) 240- 
7735. 


Flairs 92, Fifth Florida Al Research Symp. 
Apr. 7-10, Ft. Lauderdale, Fla. Contact 
Fred Hoffman, Math Dept., Florida Atlan¬ 
tic Univ., Boca Raton, FL 33431, phone 
(407) 367-3345, e-mail hoffman@servax. 
bitnet. 


Third Workshop on Future Trends in 
Distributed Computing, Apr. 14-16, 

Taipei, Taiwan. Sponsor: IEEE Computer 
Soc. Technical Committee on Distributed 
Computing. Contact Stephen S. Yau, Univ. 
of Florida, Computer and Information Sci¬ 
ences Dept., 301 CSE Bldg., Gainesville, 
PL 32611-2024, phone (904) 392-1212, fax 
(904) 392-1220. 


NATUG, North Am. Transputer Users 
Group 1992 Spring Meeting, Apr. 19-21, 

Baltimore, Md. Contact David L. Fielding, 
Advanced Computing Research Inst., Cor¬ 
nell Univ., 705 Theory Center Bldg., 
Ithaca, NY 14853-3801, phone (607) 254- 
8872, fax (607) 254-8888. 


ICCL 92, Int’l Conf. on Computer 
Languages, Apr. 20-23, San Fran¬ 
cisco. Sponsor: IEEE Computer Soc. Tech¬ 
nical Committee on Computer Languages. 
Contact Mario Barbacci, Software Eng. 
Inst., Carnegie Mellon Univ., Pittsburgh, 
PA 15213, phone (412) 268-7704, fax (412) 
268-5758, e-mail barbacci@sei.cmu.edu. 

(£f^j 15th IEEE Workshop on Design for 
Testability, Apr. 21-24, Vail, Colo. 
Contact T.W. Williams, IBM, PO Box 
1900, 67A/021, Boulder, CO 80301-9191, 
phone (303) 924-7692. 


Seventh Symp. on Microcomputer and Mi¬ 
croprocessor Applications, Apr. 22-24, 

Budapest, Hungary. Cosponsors: Scientific 
Soc. for Telecommunication et al. Contact 
SST, Budapest 5. Pf. 451., Hungary, H- 
1372, phone (361) 153-1027, fax (361) 153- 
0451. 


Third Workshop on Workstation Op- 
erating Systems, Apr. 23-24, Key 

Biscayne, Fla. Sponsor: IEEE Computer 
Soc. Technical Committee on Operating 
Systems and Applications Environments. 
Contact Joseph Boykin, 7 Hampton Rd., 
Natick, MA 01760, phone (508) 651-8228. 


SHPCC 92, Scalable High-Performance 
Computing Conf., Apr. 26-29, Williams¬ 
burg, Va. Contact Emily Todd, ICASE, 
MS 312-C, NASA Langley Research Cen¬ 
ter, Hampton, VA 23665, e-mail emily@ 
icase.edu. 


EWDC 4, Fourth European Workshop on 
Dependable Computing, Apr. 8-10, 

Prague, Czechoslovakia. Cosponsors: Ge- 
sellschaft fur Informatik et al. Contact 
Karama Kanoun,LAAS-CNRS, 7, Avenue 
du Colonel Roche, 31 077 Toulouse Cedex, 
France, fax (33) 6133-6411, e-mail 
kanoun@laas.fr; or Francesca Saglietti, 
GRS, Abteilung Prozebrechner, For- 
schungsgelande, 8046 Garching, Germany, 
fax (49) 89-3200-4300. 


23rd Pittsburgh Conf. on Modeling and 
Simulation, Apr. 30-May 1, Pittsburgh. Co¬ 
sponsors: Univ. of Pittsburgh, IEEE et al. 
Contact William G. Vogt or Marlin H. 
Mickle, 348 Benedum Eng. Hall, Univ. of 
Pittsburgh, Pittsburgh, PA 15261. 
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Working Conf. on Information System 
Concepts, Apr. 13-15, Alexandria, Egypt. 
Sponsor: Int’l Federation for Information 
Processing. Contact Eckhard D. Falken- 
berg, Information Systems Dept., Univ. of 
Nijmegen, Toernooiveld 1, NL-6525 ED 
Nijmegen, The Netherlands, fax 31 (80) 
553-450, e-mail ef@cs.kun.nl. 


DIAC 92, Directions and Implications of 
Advanced Computing, May 2-3, Berkeley, 
Calif. Sponsor: Computer Professionals for 
Social Responsibility. Contact Douglas 
Schuler, Boeing Computer Services, MS 
7L-64, PO 24346, Seattle, WA 98124-0346, 
home phone (206) 632-1659, office phone 


(206) 865-3832, fax (206) 865-2964, e-mail 
dschuler@june.cs.washington.edu. 


CHI 92, Conf. on Human Factors in 
Computing, May 3-7, Monterey, 
Calif. Sponsor: ACM. Contact Assoc, for 
Computing Machinery, 11 W. 42nd St., 
New York, NY 10036, phone (212) 869- 
7440. 


1992 IEEE Symp. on Research in Se- 
curity and Privacy, May 4-6, Oak¬ 
land, Calif. Sponsor: IEEE Computer Soc. 
Technical Committee on Security and Pri¬ 
vacy. Contact Deborah Cooper, Unisys, 
5731 Slauson Ave., Culver City, CA 90230, 
phone (310) 338-3727, e-mail cooper@ 
culv.unisys.com. 


ICOOMS 92, Int’l Conf. on Object-Orient¬ 
ed Manufacturing Systems, May 4-6, Cal¬ 
gary, Alta., Canada. Cosponsors: Soc. for 
Computer Simulation, Univ. of Calgary. 
Contact Douglas Norrie, Manufacturing 
Eng. Div., Eng. Faculty, Univ. of Calgary, 
2500 University Dr. NW, Calgary, Alta. 
T2N 1N4, Canada, phone (403) 220-5787, 
fax (403) 282-8406. 


(ffil IEEE Infocom 92,11th Conf. on 
Computer Comm., May 4-8, Flo¬ 
rence, Italy. Cosponsor: IEEE Comm. Soc. 
Contact L. Fratta, Politecnico di Milano, 
c/o Cefriel, Via Emanueli, 15, 20126 Mila¬ 
no, Italy, phone 39 (2) 2399-3578, fax 39 
(2) 2399-3587, e-mail fratta@imicefr.bitnet; 
or J. Kurose, Computer and Information 
Science Dept., Univ. of Massachusetts, 
Amherst, MA 01003, phone (413) 545- 
1585, e-mail kurose@cs.umass.edu. 


Comp Euro 92, IEEE Int’l Conf. on 
V5Z Computer Systems and Software 
Eng., May 4-8, The Hague, The Nether¬ 
lands. Cosponsors: IEEE Region 8, IEEE 
Benelux Section. Contact Patrick Dewilde, 
Delft Univ. of Tech., EE Dept., POB 5031, 
2600 GA Delft, The Netherlands, fax 31 
(15) 623-271; or G. Arink, Philips Medical 
Systems, PO Box 10000, 5680 DA Best, 
The Netherlands, phone 31 (40) 762-060, 
fax 31 (40) 762-952. 


NASA 92, Seventh Goddard Conf. on 
Space Applications of Artificial Intelli¬ 
gence, May 5-6, Greenbelt, Md. Sponsor: 
NASA Goddard Space Flight Center. Con¬ 
tact Jonathan Hartley, NASA GSFC, Code 
522, Greenbelt, MD 20771, phone (301) 
286-3150.- - 


PTW 92, Second Pacific Test Work- 
shop, May 5-8, Vancouver, B.C., 
Canada. Sponsor: IEEE Computer Soc. 
Technical Committee on Test Tech. Con¬ 
tact Andre Ivanov, Electrical Eng. Dept., 
Univ. of British Columbia, 2356 Mian 
Mall, Vancouver, B.C. V6T 1Z4, Canada, 
phone (604) 822-6936, fax (604) 822-5949; 
or Mani Soma, phone (206) 685-3810, fax 
(206) 543-3842. 


1992 IEEE Int’l Conf. on Robotics and 
Automation, May 10-15, Nice, France. 
Sponsor: IEEE Robotics and Automation 
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Soc. Contact Harry Hayman, PO Box 3216, 
Silver Spring, MD 20918, phone (301) 236- 
5621, fax (301) 236-5621. 

((jj) ICSE 92,14th Int’l Conf. on Soft- 
ware Eng., May 11-15, Melbourne, 
Australia. Cosponsors: IEEE Computer 
Soc. Technical Committee on Software 
Eng. et al. Contact IEEE Computer Soc., 
1730 Massachusetts Ave. NW, Washington, 
DC 20036-1903, phone (202) 371-1013, fax 
(202) 728-0884; or A.Y. Montgomery, 
Computer Science Dept., Royal Mel¬ 
bourne Inst, of Tech., PO Box 2476V, Mel¬ 
bourne 3001, Victoria, Australia, phone 61 
(3) 660-2943, fax 61 (3) 662-1617, e-mail 
aym@goanna.cs.rmit.oz.au. 

(Qjh Ninth IEEE Workshop on Real- 
vS?' Time Operating Systems and Soft¬ 
ware, May 13-14, Pittsburgh. Cosponsors: 
IEEE Computer Soc. Technical Commit¬ 
tee on Real-Time Systems, Office of Naval 
Research. Contact Marc Donner, IBM Re¬ 
search, PO Box 218, Yorktown Heights. 

NY 10598, phone (914) 784-7508 or (914) 
945-2032, fax (914) 945-1234, e-mail 
donner@watson.ibm.com. 

SID 92, Int'l Symp. and Seminar of the 
Soc. for Information Display, May 17-22, 

Boston. Contact Paul M. Alt, IBM T.J. 
Watson Research Center, PO Box 218, 
Yorktown Heights, NY 10598, phone (914) 
945-2437, fax (914) 945-1974, e-mail alt@ 
watson.ibm.com; Joyce Farrell, Hewlett- 
Packard Labs, PO Box 10490, Palo Alto, 

CA 94303-0969, phone (415) 857-2807, fax 
(415) 857-4320, e-mail farrell@hplabs.hp. 
com; or Jay Morreale, Palisades Inst, for 
Research Services, 201 Varick St., Suite 
1140, New York, NY 10014, phone (212) 
620-3371, fax (212) 620-3379. 

(fW) Workshop on Interconnections with- 
'SZ' in High-Speed Digital Systems, May 
18-20, Santa Fe, N.M. Sponsor: IEEE La¬ 
sers and Electro-Optics Soc. Contact Ken 
Young, Bellcore, 445 South St., Rm. 2Q- 
150, Morristown, NY 07962-1910. 


IEEE Int’l Symp. on Industrial Electron¬ 
ics, May 25-29, Xian, China. Cosponsors: 
IEEE Industrial Electronics Soc. et al. 
Contact Allen C. Chen, AT&T Bell Labs, 
Rm. 1E134A, Whippany Rd., PO Box 903, 
Whippany Road, NJ 07981-0903; J.D. Ir¬ 
win, Electrical Eng. Dept., Auburn Univ., 
Auburn, AL 36849, phone (205) 844-1810, 
fax (205) 844-1809; or Sheng-wu Liu, Li¬ 
brary, Northwestern Polytechnical Univ., 
Xian 710071, China, phone 1 (29) 533-71 
ext. 2926, fax 1 (29) 711-959. 

/Qjj) Israeli Symp. on the Theory of Com- 

puting and Systems, May 27-28, Hai¬ 
fa, Israel. Cosponsors: Assoc, of Techno¬ 
logical Education in Electronics and Com¬ 
puter Science et al. Contact Michael 
Rodeh, Science and Tech., IBM Israel, 
Technion City, Haifa 32000, Israel, phone 
972 (4) 296-205, fax 972 (4) 320-894. 

MVL 92, 22nd Int’l Symp. on Multi- 
vi?' ple-Valued Logic, May 27-29, Sendai, 
Japan. Sponsors: IEEE Computer Soc. 
Technical Committee on Multiple-Valued 
Logic, Japan Research Group on Multiple- 
Valued Logic. Contact Tatsuo Higuchi, 
Electronic Eng. Dept., Tohoku Univ., 
Aoba, Aramaki, Sendai 980, Japan, phone 
81 (022) 222-1800, fax 81 (022) 263-9406, 
e-mail thiguchi@higuchi.ecei.tohoku.ac.jp; 
or S.B. Silio, Electrical Eng. Dept., Univ. 
of Maryland, College Park, MD 20742, 
phone (301) 454-6839, fax (301) 454-1837, 
e-mail silio@eng.umd.edu. 

Symp. on Assessment of Quality 

Software Development Tools, May 
27-29, New Orleans. Sponsor: Tulane 
Univ. Contact Judy Lee, IBM, 1000 NW 
51st. St., Boca Raton, FL 33432, phone 
(407) 982-1048; or Ez Nahouraii, IBM 
(798/089), 6321 San Ignacio Ave., San Jose, 
CA 95119, phone (408) 281-5741, e-mail 
eznah@stlvm7.iinusl .ibm.com. 

ICCI, Fourth Conf. on Computing 

and Information, May 28-30, Toron¬ 
to. Contact Peter E. Lauer, Computer Sci¬ 
ence and Systems, McMaster Univ., Hamil¬ 
ton, Ont., Canada, phone (416) 525-9140. 


lyre ISCA 92,19th Int’l Symp. on Com- 
'N&Z puter Architecture, May 19-21, 

Queensland, Australia. Cosponsors: IEEE, 

ACM SIGArch. Contact Jean-Lue Gaudi- June 1992 
ot, EE-Systems Dept., SAL 300, Univ. of 
Southern California, Los Angeles, CA 
90089-0781, phone (213) 740-4484 or (213) 

743-0249, fax (213) 740-4449, e-mail 
gaudiot@usc.edu or gaudiot@priam.usc. 
edu; or D. Abramson, CSIRO, 723 Swan- 
ston St., Melbourne 3000, Australia, phone 
61 (3) 282-2666, e-mail david.abramson@ 
mel.dit.csiro.au. 


FGCS 92, Int’l Conf. on Fifth-Gener- 
ation Computer Systems, June 1-5, 

Tokyo. Cosponsors: Information Process¬ 
ing Soc. of Japan et al. Contact Hidehiko 
Tanaka, Univ. of Tokyo, 3-1 Hongo 7- 
chome, Bunkyo-ku, Tokyo 113, Japan, 
phone 81 (33) 3812-2111 ext. 6663, fax 81 
(33) 3818-5706. 


OTnl Sixth Int’l Conf. on Computer Sci- 
ence, May 20-22, Tunis, Tunisia. 
Sponsor: Assoc. Francaise pour la Cyber- 
netique Economique et Technique. Con¬ 
tact Montasser Ouaili, Ecole Nationale des 
Sciences de l’lnformatique, 16 Rue 8010 
Quartier Montplaisir, 1002 Tunis Belve¬ 
dere, Tunisia, phone 216 (1) 784-032, fax 
216 (1) 787-827. 


yra Euro ASIC 92, June 1-5, Paris. Con- 
tact Gabriele Saucier, Laboratoire 
CSI/Institut National, Polytechnic de 
Grenoble, 46 Ave. Felix Viallet/38000 
Grenoble, France, phone (33) 76-574-687, 
fax (33) 76-503-421. 

® Sixth Israel Conf. on Computer Sys¬ 
tems and Software Eng., June 2-3, 


Herzliya, Israel. Sponsors: IEEE Comput¬ 
er Soc. Israel Chapter, Information Pro¬ 
cessing Assoc, of Israel. Contact Confer¬ 
ence Secretariat, Ortra, PO Box 50432, Tel 
Aviv 61500, Israel, phone 972 (3) 664-825, 
fax 972 (3) 660-952. 

/ffi) European Design for Testability 
Conf., June 2-4, Brugge, Belgium. 
Contact Thomas W. Williams, phone (303) 
924-9912; or J. Jamison, phone 32 (2) 118- 
7511. 

COGANN, Combination of Genetic Algo¬ 
rithms and Neural Networks Workshop, 
June 6, Baltimore. Sponsor: IEEE Neural 
Networks Council. Contact Darrell Whit¬ 
ley, Computer Science Dept., Colorado 
State Univ., Fort Collins, CO 80523, phone 
(303) 491-5373, e-mail whitley@cs. 
colostate.edu. 

IJCNN 92, Int’l Joint Conf. on Neural Net¬ 
works, June 7-11, Baltimore. Cosponsors: 
IEEE, Int’l Neural Network Soc. Contact 
IJCNN 92, 5665 Oberlin Dr. #110, San Di¬ 
ego, CA 92121, phone (619) 453-6222, fax 
(619) 535-3880. 

DAC 92,29th IEEE/ACM Design 
Automation Conf., June 8-12, Ana¬ 
heim, Calif. Contact DAC 92, MP Associ¬ 
ates, 7490 Clubhouse Rd., Boulder, CO 
80301, phone (303) 530-4333; or Dan 
Schweikert, Cadence Design Systems, 555 
River Oaks Pkwy., Bldg. 4, San Jose, CA 
95132, phone (408) 944-7297, e-mail dan@ 
cadence.com. 

® Sixth Int’l Workshop on Scientific 
and Statistical Database Manage¬ 
ment, June 9-11, Zurich, Switzerland. Co¬ 
sponsor: ACM SIGMOD. Contact H. 
Hinterberger, Inst, for Scientific Comput¬ 
ing, ETH Zentrum, CH-8093 Zurich, Swit¬ 
zerland, phone 41 (1) 254-7436, fax 41 (1) 
262-3973. 

(jgjj) IEA/AIE 92, Fifth Int’l Conf. on In- 
v!?' dustrial and Eng. Applications of Ar¬ 
tificial Intelligence and Expert Systems, 
June 9-12, Paderborn, Germany. Cospon¬ 
sors: Univ. of Paderborn et al. Contact 
Fevzi Belli, Univ. of Paderborn, Postfash 
1621, 4790 Paderborn, Germany, phone 49 
(5251) 60-3282, fax 49 (5251) 602-519, 
e-mail belli@adt.uni-paderborn.de. 

ICDCS 92,12th Int’l Conf. on Dis- 
tributed Computing Systems, June 9- 

12, Yokohama, Japan. Cosponsor: Infor¬ 
mation Processing Soc. of Japan. Contact 
Ming T. (Mike) Liu, Computer and Infor¬ 
mation Science Dept., Ohio State Univ., 
2036 Neil Ave., Columbus, OH 43210- 
1277, phone (614) 292-6552, fax (614) 292- 
9021, e-mail mike.liu@osu.edu; or Yutaka 
Matsushita, Instrumentation Eng. Dept., 
Keio Univ., 3-14-1 Hiyoshi, Kohoku-ku, 
Yokohama, Japan 223, phone 81 (045) 563- 
1141 ext. 3564, fax 81 (045) 562-7625, 
e-mail on@inst.keio.ac.jp. 

® ITS 92, Second Int’l Conf. on Intelli¬ 
gent Tutoring Systems, June 10-12, 
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Montreal. Cosponsors: Univ. of Montreal 
et al. Contact Claude Frasson, Departe- 
ment d’IRO, Universite de Montreal, CP 
6128 Succ A. Montreal, Que. H3C 3J7, 
Canada, phone (514) 343-7019, fax (514) 
343-5834, e-mail frasson@iro.umontreal.ca. 

CBMS 92, IEEE Symp. on Comput- 
er-Based Medical Systems, June 14- 

17, Durham, N.C. Cosponsors: IEEE Com¬ 
puter Soc., IEEE Eastern North Carolina 
Section, Eng. in Medicine and Biology Soc. 
Contact James N. Brown, Jr., Research 
Triangle Inst., PO Box 12194, Research 
Triangle Park, NC 27709, phone (919) 541- 
6975, fax (919) 541-5945; or Peter Santago, 
Radiology Dept., Bowman Gray School of 
Medicine, Medical Center Blvd., Winston- 
Salem, NC 27157-1022, phone (919) 748- 
4260, fax (919) 748-2870, e-mail cbms@ 
mrips.bgsm.wfu.edu. 

NECC 92,13th Nat’l Educational 
Computing Conf., June 15-17, Dal¬ 
las. Contact Cathy Norris, Computer Edu¬ 
cation and Cognitive Systems, Univ. of 
North Texas, PO Box 5155, UNT Station, 
Denton, TX 76203, phone (817) 565-2824, 
fax (817) 565-4425. 

/£f^) ICSI 92, Second Int’l Conf. on Sys- 
terns Integration, June 15-18, Morris¬ 
town, N.J. Cosponsors: ACM et al. Contact 
Peter A. Ng, Computer and Information 
Science Dept., New Jersey Inst, of Tech., 
University Heights, Newark, NJ 07102, 
phone (201) 596-3387, e-mail ng_p@ 
vienna.njit.edu; or C.V. Ramamoorthy, 
Univ. of California at Berkeley, EECS 
Dept., Berkeley, CA 94720, phone (415) 
642-4751, fax (415) 642-5775. 

CVPR 92, IEEE Computer Society 
Conf. on Computer Vision and Pat¬ 
tern Recognition, June 15-18, Champaign, 
Ill. Contact A. Rosenfeld, Center for Au¬ 
tomation Research, Univ. of Maryland, 
College Park, MD 20742, e-mail ar@alv. 
umd.edu. 

Parle 92, Parallel Architectures and 
Languages Europe. June 15-18, Par¬ 
is. Cosponsors: IEEE Region 8 et al. Con¬ 
tact Daniel Etiemble, LRI, Bat 490, Univ. 
of Paris Sud, 91405 Orsay Cedex, France, 
phone 33 (1) 6941-6621, fax 33 (1) 6941- 
6586, e-mail de@lri.lri.fr. 

Compass 92, Seventh Conf. on Computer 
Assurance, June 15-18, Gaithersburg, Md. 
Cosponsors: IEEE, IEEE Aerospace and 
Electronic Systems Soc. Contact Robert 
Ayers, ARINC Research, 2551 Riva Rd., 
Annapolis, MD 21401, phone (301) 266- 
4741, fax (301) 266-4040. 

Computer Security Foundations 
'kss Workshop V, June 16-18, Franconia, 
N.H. Contact Leonard LaPadula, Informa¬ 
tion Security Technical Center, Mitre 
Corp., Bedford, MA 01730-0208, phone 
(617) 271-3261, e-mail ljl@mitre.org; or 
Ravi Sandhu, ISSE Dept., George Mason 
Univ., Fairfax, VA 22030-4444, phone 
(703) 993-1659, e-mail sandhu@sitevax. 
gmu.edu. 


SEKE 92, Fourth Int’l Conf. on Soft- J u Jy 1992 
ware Eng. and Knowledge Eng., 

June 17-19, Capri, Italy. Cosponsors: Univ. 
of Salerno et al. Contact Shi-Kuo Chang, 

Computer Science Dept., Univ. of Pitts¬ 
burgh, Pittsburgh, PA 15260, phone (412) 

624-8493; or Genoveffa Tortora, Diparti- 
mento di Informatica ed Applicazioni, 

Univ. di Salerno, 1-84081 Baronissi, Saler¬ 
no, Italy, phone 39 (89) 822-333, fax 39 
(81) 482-745, e-mail jentor@udsab.dia. 
unisa.it or usditort@inacriai.bitnet. 


IEEE Intelligent Vehicles 92, July 1-2, 

Michigan. Sponsor: IEEE Industrial Elec¬ 
tronics Soc. Contact Ichiro Masaki, Com¬ 
puter Science Dept., GM Research Labs, 
30500 Mound Rd., Warren, MI 48090-9055, 
phone (313) 986-1466, fax (313) 986-9356, 
e-mail masaki@gmr.com. 


Int’l Workshop on Hardware Fault- 
Tolerance in Multiprocessors, June 
22-23, Amherst, Mass. Sponsor: IEEE 
Computer Soc. Technical Committee on 
Fault-Tolerant Computing. Contact 
Donald S. Fussell, Computer Science 
Dept., TAY 2.124, Univ. of Texas at Aus¬ 
tin, Austin, TX 78712-1188, phone (512) 
471-9719, fax (512) 471-7028, e-mail 
fussell@cs.utexas.edu. 

Seventh IEEE Conf. on Structure in 
Complexity Theory, June 22-25, Bos¬ 
ton. Sponsor: IEEE Computer Soc. Tech¬ 
nical Committee for Math. Foundations of 
Computing. Contact J. Royer, School of 
Computer and Information Science, Syra¬ 
cuse Univ., Syracuse, NY 13244, fax (315) 
443-1954, e-mail royer@top.cis.syr.edu. 

LICS, Seventh IEEE Symp. on Logic 
in Computer Science, June 22-25, 

Santa Cruz, Calif. Sponsor: IEEE Comput¬ 
er Soc. Technical Committee on Mathe¬ 
matical Foundations in Computing. Con¬ 
tact Robert L. Constable, Computer 
Science Dept., Cornell Univ., 4149 Upson, 
Ithaca, NY 14853, phone (607) 255-9204, 
fax (607) 255-4428. 

CGI 92, Computer Graphics Int’l 
Conf., June 22-26, Tokyo. Cospon¬ 
sors: Computer Graphics Soc. et al. Con¬ 
tact R.A. Earnshaw, Computer Graphics, 
Univ. of Leeds, Leeds LS2 9JT, England, 
phone 44 (532) 335-414, fax 44 (532) 336- 
017; or Toki Takahashi, Autonomous Ro¬ 
bot Systems Lab, NTT Human Interface 
Labs, 3-9-11 Midori-cho, Musashino-shi, 
Tokyo 180, Japan, phone 81 (422) 59-2406, 
fax 81 (422) 59-2245, e-mail cgi92@nttarm. 
ntt.jp (Internet). 

® RSP, Third Int’l Workshop on Rapid 
System Prototyping, June 23-25, Re¬ 
search Triangle Park, N.C. Cosponsors: 
IEEE Computer Soc. Technical Commit¬ 
tees on Design Automation, Simulation, 
and Test Tech. Contact Nick Kanopoulos, 
Center for Systems Eng., Research Trian¬ 
gle Inst., 3040 Cornwallis Rd., Research 
Triangle Park, NC 27709, phone (919) 541- 
7341, fax (919) 541-6515, e-mail rsp@rti. 
rti.org. 

I0th Software Reliability Symp., 
June 25-26, Denver. Sponsor: IEEE 
Reliability Soc. Denver Chapter. Contact 
Rich Karcich, Storage Tech. Corp., 2270 S. 
88th St., MS 2286, Louisville. CO 80028- 
5319, phone (303) 673-6223, fax (303) 673- 
3353. 


I ^f^ l 1992 Workshop on Fault-Tolerant 

Parallel and Distributed Systems, 

July 6-7, Amherst, Mass. Cosponsor: Univ. 
of Massachusetts at Amherst. Contact 
Donald S. Fussell, Computer Science 
Dept., Univ. of Texas at Austin, TAY 
2.124, Austin, TX 78712-1188, phone (512) 
471-9719, fax (512) 471-7028, e-mail 
fussell@cs.utexas.edu. 

CASE 92, Fifth Int’l Workshop on 

Computer-Aided Software Eng., July 
6-10, Montreal. Contact Gene Forte, 

CASE Outlook, 11830 Kerr Pkwy. #315, 
Lake Oswego, OR 97035, phone (503) 245- 
6880, fax (503) 245-6935, e-mail g.forte@ 
compmail.com; Nazim Madhavji, School of 
Computer Science, McGill Univ., 3480 
University St., Montreal, Que. H3A 2A7, 
Canada, phone (514) 398-3740, fax (514) 
398-3883, e-mail case@softeng.cs.mcgill.ca; 
or John Jenkins, City Univ. of London, 
Business Systems Analysis Dept., London, 
UK, fax 44 (71) 608-1270. 

Iros 92, Fifth Int’l Conf. on Intelligent Ro¬ 
bots and Systems, July 7-10, Raleigh, N.C. 
Cosponsors: IEEE Industrial Electronics 
Soc., Robotics Soc. of Japan et al. Contact 
Avi Kak, School of Electrical Eng., Purdue 
Univ., West Lafayette, IN 47907, phone 
(317) 494-3551, fax (317) 494-6440, e-mail 
kak@ecn.purdue.edu; or Kazuo Tanie, Me¬ 
chanical Eng. Lab, MITI, 1-2 Namiki, 
Tsukuba-shi, Ibaraki-ken, 305, Japan. 

FTCS 22,22nd Int’l Symp. on Fault- 

Tolerant Computing, July 8-10, Bos¬ 
ton. Cosponsor: Univ. of Massachusetts at 
Amherst. Contact Dhiraj K. Pradhan, 
Electrical and Computer Eng. Dept., Univ. 
of Massachusetts at Amherst, Amherst, 
MA 01003, phone (413) 545-0160, fax (413) 
545-4611, e-mail pradhan@ecs.umass.edu. 


August 1992 

® 17th Congress Workshop, Aug. 2-14, 

Washington, DC. Sponsor: United 
Nations. Contact Lawrence W. Fritz, GE 
Aerospace, PO Box 8048-10A26, Philadel¬ 
phia, PA 19101, phone (215) 531-3205, fax 
(215) 962-3698. 

Hot Chips IV, Symp. on High-Per- 
formance Chips, Aug. 9-11, Stanford, 
Calif. Sponsor: IEEE Computer Soc. Tech¬ 
nical Committee on Microprocessors and 
Microcomputers. Contact Glen Langdon, 
phone (408) 459-2212, e-mail langdon@ 
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CAREER OPPORTUNITIES 


RATES: $15.00 per line (ten lines 
minimum). Average five typeset words 
per line, eight lines per column inch. Add 
$10 for box number. Send copy at least 
one month prior to publication date to: 
Marian B. Tibayan, Classified 
Advertising, COMPUTER Magazine, 
10662 Los Vaqueros Circle, PO Box 
3014, Los Alamitos, CA 90720-1264; 
(714) 821-8380; fax: (714) 821-4010. 

In order to conform to the Age Discrimina¬ 
tion in Employment Act and to discourage 
age discrimination, COMPUTER may re¬ 
ject any advertisement containing any of 
these phrases or similar ones: “...recent 
college grads...,” “...1-4 years maximum 
experience...,” “...up to 5 years experi¬ 
ence,” or “...10 years maximum experi¬ 
ence.” COMPUTER reserves the right to 
append to any advertisement without spe¬ 
cific notice to the advertiser. Experience 
ranges are suggested minimum require¬ 
ments, not maximums. COMPUTER as¬ 
sumes that since advertisers have been 
notified of this policy in advance, they 
agree that any experience requirements, 
whether stated as ranges or otherwise, will 
be construed by the reader as minimum re- 
quirements only. COMPUTER encour¬ 
ages employers to offer salaries that are 
competitive, but occasionally a salary may 
be offered that is significantly below cur¬ 
rently acceptable levels. In such cases the 
reader may wish to inquire of the employer 
whether extenuating circumstances apply. 


SENIOR SYSTEMS ANALYST 
$55,000/yr. 


Duties: Design and implement computer 
network system monitoring tools: design 
data analysis requirements: design and im¬ 
plement experiments for collecting data, and 
analyze data; provide simulation model and 
data for new system development; and con¬ 
duct system optimal design analysis. 

Requirements: Ph.D. in Management 
Information System or Applied Mathe¬ 
matics; 5 years experience in job offered or 5 
years experience in network optimization 
problems and computer software engineer¬ 
ing, using SOL database system; FOR¬ 
TRAN, C and UNIX scripts; and Remote 
Procedure Call and socket-based Inter¬ 
process Communication network program¬ 
ming; and at least 10 publications evidencing 
experience in the field. 


Apply at the Texas Employment Commis¬ 
sion, Dallas, Texas, or send resume to the 
Texas Employment Commission, TEC Build¬ 
ing, Austin. Texas 78778, J.O. # 6422243. 
Ad Paid by an Equal Opportunity Employer. 


SUNY New Paltz 

The Department of Mathematics and Com¬ 
puter Science invites applications for two 
Computer Science tenure-track positions, 
one junior, one senior, starting Fall, 1992. 
The junior position requires a Ph.D. in Com¬ 
puter Science. The senior position requires 
an established record of scholarship in Com¬ 
puter Science and a Ph.D. in Computer Sci¬ 
ence or a related field. 

SUNY New Paltz is in the Hudson Valley/ 
Catskill region of New York with easy access 
to NYC. The College offers a BS and MS in 
Computer Science and a BS in Electrical 
Engineering, and enjoys a close relationship 
with nearby computer industry. 

Send resume and 3 letters of recommen¬ 
dation to A. J. Dos Reis, Department of 
Mathematics and Computer Science, Box 
10, SUNY New Paltz, New Paltz, N.Y. 
12561. Closing date for application is March 
16, 1992. SUNY is an affirmative action/ 
equal opportunity employer. 


WINONA STATE UNIVERSITY 

Department of Computer Science 

The Department of Computer Science in¬ 
vites applications for a tenure-track faculty 
position. Rank/Title and Salary will be based 
on qualification and experience. A Ph.D. in 
Computer Science or closely related field 
must be completed by start of appointment. 
Duties include teaching full range of Com¬ 
puter Science courses based on ACM 
guidelines and assignments at the WSU- 
Rochester Center. A commitment to ex¬ 
cellence in teaching is essential. Other ac¬ 
tivities include participation in departmental 
and university committees, student advising, 
and participation in CS projects and 
research. 

Winona State University is located in the 
beautiful city of Winona in Southeastern 
Minnesota. The mission of the University is 
to serve the broad educational needs of the 
people of the region and to provide high- 
quality programs. 

The Department of Computer Science 
has eleven full-time faculty members and of¬ 
fers strong programs in Computer Science, 
CIS and MIS. 

The department takes pride in its col- 
legiality and relations with area industry. 

Qualified applicants should submit a letter 
of interest, detailed resume, copy of tran¬ 
scripts (with official transcripts to follow im¬ 
mediately). list of three or more references, 
with addresses and phone numbers (Refer¬ 
ence letters preferred) to: Computer Science 
Search, Affirmative Action Office, Winona 
State University, Winona, MN 55987. Initial 
screening will begin February 1, 1992, and 
will continue until the position is filled. 
AA/EEO. Women, minorities, and disabled 
individuals are encouraged to apply. 


NORTHEAST 
LOUISIANA UNIVERSITY 

Applications are invited for a tenure track 
assistant or associate professor in a CSAB 
accredited computer science program. Can¬ 
didates should have a Ph.D. in computer sci¬ 
ence preferably with emphasis in software 
engineering and a strong commitment to 
teaching. Teaching load is 6-9 hours with 
competitive salary and twenty year retire¬ 
ment program. Applications will be accepted 
until position is filled. Applicants should send 
a resume, transcripts, and three references 
to Dr. Kris Cooper, Chair. Search Commit¬ 
tee, Department of Computer Science, 
Monroe, LA 71209-0575. Phone (318) 
342-1856. NLU is an Equal Opportunity Af¬ 
firmative Action Employer. 


ACADEMIA SINICA 
TAIWAN, REPUBLIC OF CHINA 
Institute of Information Science 

Applications are invited for research posi¬ 
tion in Institute of Information Science. 
Academia Sinica, Ph.D. in Computer Sci¬ 
ence or closely related fields required. 
Demonstratable research ability necessary. 
Applicants for senior positions must have 
proven research record. All fields in Com¬ 
puter Science are welcome. 

The Institute offers a good research envi¬ 
ronment. No duty of teaching. Facilities 
include a 32-node NCUBE.2 parallel super¬ 
computer, many SUN, SGI, and E&S work¬ 
stations. An easily accessible ETA-10Q 
supercomputer is in the Academia Sinica. 

Interested people please send application 
to Dr. Lin-Shan Lee, Director, Institute of In¬ 
formation Science, Academia Sinica, Tai¬ 
pei, Taiwan, 11529, Republic of China. 
Fax: (011-886-2) 782-4814. 


ARKANSAS STATE UNIVERSITY 
Faculty Position in Computer Science 

The Department of Computer Science, 
Mathematics and Physics invites applications 
for a tenure track position beginning August 
15, 1992. The department offers the B.S. 
andM.S. degrees in Computer Science. The 
Doctorate or ABD status in computer sci¬ 
ence is required. Duties Include teaching and 
research. Provide a current resume, three 
current letters of reference, and transcripts 
(copies acceptable). Screening will begin 
February 15, 1992 and continue until the 
position is filled. Submit applications to Dr. 
Robert P. Smith, Chair of the Search Com¬ 
mittee, P.O. Box 70. State University, AR 
72467. ASU is an Equal Opportunity/ 
Affirmative Action employer. 
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D.E. SHAW & CO. 

Algorithmic Trading 

D.E. Shaw & Co. is a small (several dozen 
employees), highly capitalized (over $300 
million in partners' equity), extremely suc¬ 
cessful Wall Street firm specializing in quan¬ 
titative finance and computational trading. 
Computer scientists and system designers 
form the professional core of the firm, and 
not merely its support staff, and participate in 
its profits. Our technical staff includes both 
Ph.D.-level researchers recruited from Stan¬ 
ford, MIT, and other leading computer sci¬ 
ence departments and extraordinarily talented 
B.S.-and M.S.-level system designers and 
“superhackers”. It is our practice to compen¬ 
sate unusually gifted individuals at a level ex¬ 
ceeding that of the market. Applicants should 
send resumes to Jennifer Strulson, D.E. 
Shaw & Co., 39th Floor, Tower 45, 120 W. 
45th St.. New York, NY 10036. 


NATIONAL CHIAO TUNG 
UNIVERSITY 

Taiwan, Republic of China 

Applications are invited for faculty posi¬ 
tions in both Department of Computer and 
Information Science and the Department of 
Computer Science and Information Engi¬ 
neering beginning in August 1992. A Ph.D. 
in Computer or Information Science/Engi¬ 
neering or closely related fields is required. 
The University 

National Chiao Tung University is located 
in Hsinchu City near the Science and In¬ 
dustrial Park in Taiwan. It is famous for its 
programs in the areas of Co'mputer Science, 
Computer Engineering, and Electrical Engi¬ 
neering. There are two computer related 
departments in the university, the Depart¬ 
ment of Computer and Information Science 
in Science College and Department of Com¬ 
puter Science and Information Engineering 
in Engineering College. The University 
Computer Center has CONVEX C240 and 
C220 minisupercomputers which are ac¬ 
cessible through FDDI campus network from 
all departments. The Center is the regional 
center of TANet which connects universities 
and research institutes in Taiwan. TANet 
connects to Internet so researchers in Taiwan 
can easily communicate with people in the 
foreign countries. 

Department of Computer and 
Information Science 

The department provides an excellent 
research and teaching environment. Depart¬ 
ment facilities include 20 SUN workstations, 
6 SGI graphics workstations (including one 
VGX340). 15 IBM RS-6000, some SONY 
and DECstations, one 16 nodes nCUBE 
parallel computer, one 16 nodes Trans¬ 
puter, two CONVEX C120 minisupercom¬ 
puters, and many PCs. All computers in the 
Department are interconnected via Ether¬ 
net. The department offers BS, MS, and 
Ph.D. programs. Applicants should have 
strong commitment to teaching graduate 
and undergraduate courses and performing 
research. All areas in Computer Science and 
Computer Engineering are welcome. 

Interested people please send resume and 


names of three references to Professor Wei- 
Pang Yang. Head. Department of Com¬ 
puter and Information Science. National 
Chiao Tung University, Hsinchu, Taiwan, 
Republic of China by the end of February 
1992. 

FAX: (011-886-35) 721-490. 
Department of Computer Science and 
Information Engineering 

The department offers BS, MS, and 
Ph.D. programs. Theory and implementa¬ 
tion of computing systems are both em¬ 
phasized in the department. General pur¬ 
pose computing is provided by a VAX8530, 
two VAX780s, 30 SUN workstations, 10 
IBM RS-6000s, 30 MACs, some HPs, and 
over 150 PCs. Special purpose computing 
facilities include one Sequent S27, three 
transputer networks, one hypercube and 
some IRIS workstations. All computers are 
connected via Ethernet. The department 
also owns the largest nationwide digital 
system laboratory. Special equipment and 
packages for VLSI and digital system design, 
simulation and performance analysis, image 
processing and vision analysis, and software 
engineering are available. Duties of faculty 
members will include teaching undergradu¬ 
ate and graduate courses and performing 
research. The department will give special 
support to new faculty members for starting 
their research. 

The department is particularly interested 
in individuals conducting research in com¬ 
puter system, compiler, computation theory 
and algorithms, DBMS, Multi-media infor¬ 
mation system, and parallel architecture. 
However, qualified candidates in all areas 
will be considered. Interested applicants are 
invited to send a resume and three refer¬ 
ences to Professor Suh-Yin Lee, Head, De¬ 
partment of Computer Science and Informa¬ 
tion Engineering, National Chiao Tung 
University, Hsinchu, Taiwan 300, Republic 
of China by the end of February, 1992. 

FAX: (011-886-35) 724-176. 


COMPUTER SPECIALIST 
$30,000/yr. 

Duties: Installation, configuration and' 
administration of an Intel ipsc-860 hyper¬ 
cube: system software installation and con¬ 
figuration, routine preventative mainte¬ 
nance. hardware and software checks and 
system backup and restoration on platforms 
such as Sun Sparc servers, Sun Sparc sta¬ 
tions, IBM RISC/6000 and HP 700; net¬ 
work configuration and installation, such as 
E Mail. NFS, NIS, and DNS; and installation 
and configuration of peripheral units such as 
printers, plotters and tape drives, along with 
application software installation and con¬ 
figuration . 

Requirements: M S. in Computer Sci¬ 
ence with a GPA of 3.7/4.0 or above; and 
1 year experience in computer programming 
involving use of C computer languages and 
UNIX operating system. 

Job Site: Dallas, Texas 

Apply at the Texas Employment Commis¬ 
sion, Dallas, Texas, or send resume to the 
Texas Employment Commission, TEC Build¬ 
ing, Austin. Texas 78778, J.O. # 6422276. 
Ad Paid by an Equal Employment Oppor¬ 
tunity Employer. 


SOFTWARE ENGINEER/ 
SYSTEMS ANALYST 

Software Engineer/Systems Analyst; 
Responsible for customized software devel¬ 
opment in the chemical, gas, and oil fields. 
Software will be developed to run on LAN, 
using various tools/languages/DOS. Win¬ 
dows 3.0. Applications will be of a scientific 
nature using dbase systems. The developed 
software graded toward efficient/safe opera¬ 
tion and maintenance of critical equipment 
at chemical plant. Constant self-actualization 
and research required to develop these sys¬ 
tems with latest most appropriate techni¬ 
ques. Must have MS in Computer Science or 
Systems Science; no training required: Must 
have 1 year of experience in job offered; 
which must have included knowledge of de¬ 
signing dbase systems, graphic processing, 
object-oriented languages and environment, 
AI. data structures and design of algorithms 
and related mathematical theories along with 
use of Clipper, DOS, C, WINDOW-based 
systems, DBaselH and graphic interfaces; 
$36,000/yr.; 40 hour work week; 7:45 am - 
4:15 pm (30 minutes unpaid lunch). Con¬ 
tact Louisiana office of Employment Securi¬ 
ty. Job #099062, 1991 Wooddale Blvd., 
Baton Rouge, LA 70806. 


SOUTHWEST OHIO 
MCAE COMPANY 

Technical Development Engineer, 
Geometry Interfaces 

Duties: Create commercial quality soft¬ 
ware applications that provide customers the 
ability to exchange mechanical product 
design information with cooperative and 
competing design products. Project respon¬ 
sibilities include data structure and data base 
design, software module design and imple¬ 
mentation, unit and system testing, mainte¬ 
nance and quality assurance planning. 

Bachelor of Science degree in Computer 
Science required with at least an under¬ 
graduate minor in Mathematics. Course- 
work must include at least two semester or 
equivalent quarter hours in calculus and/or 
analytic geometry, at least three semester 
hours or equivalent quarter hours in each of 
the following: a) Data Structures, b) Fortran 
Programming and c) C Programming. 

No experience required in above duties, 
but applicants will qualify with two years’ ex¬ 
perience in the- development of technically 
oriented commercial software products for a 
large commercial enterprise. Employer will 
accept in place of one year of this experi¬ 
ence, a Master’s degree in computer science. 
The applicant must have experience in an 
academic (at least three courses or six month 
research) or commercial (at least six months) 
environment in the use of DEC VAX/VMS. 
Forty hrs./wk., 8 a.m.-5 p.m., Mon.-Fri.; 
Salary: $36,120 (exempt). 

Must have proof of legal authority to work 
permanently in the U.S. Send resume in 
duplicate (no calls) to G. Maugans, JO 
#1230808, Ohio Bureau of Employment 
Services, P.O. Box 1618, Columbus, Ohio 
43216. 
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MICHIGAN STATE UNIVERSITY 

Department of Computer Science 

The Department of Computer Science in¬ 
vites applications for anticipated tenure- 
stream positions at the assistant and asso¬ 
ciate levels, rank and salary commensurate 
with experience. Candidates from all areas 
of specialization in computer science or com¬ 
puter engineering will be considered; how¬ 
ever. the department has a special interest in 
candidates in the areas of software engineer¬ 
ing, computer vision, and scientific visualiza¬ 
tion who also are interested in parallel com¬ 
puting. The research foci of the department 
include artificial intelligence and knowledge- 
based systems, computer architecture and 
design automation, parallel and distributed 
computing systems and algorithms, pattern 
recognition and image prpc^ssing. software 
systems, and theoretical computer science. 
Candidates should have a Ph.D. in com¬ 
puter science or computer engineering and 
have a strong interest in both research and 
teaching. The appointments will begin in 
August, 1992. For full consideration, ap¬ 
plications should be submitted by February 
1. 1992. However, applications will be ac¬ 
cepted until the positions are filled. 

As a unit within the College of Engineering 
at Michigan State University, Computer Sci¬ 
ence offers the Bachelor of Science, Master 
of Science and Doctor of Philosophy degrees. 
The department currently has 26 tenure- 
stream faculty, and an enrollment of 180 
graduate students and 480 undergraduates. 
Special support is available from within the 
college and university to initiate research by 
new faculty members. Faculty offices are 
connected to the MSUnet which provides 
access to an array of campus computing re¬ 
sources including the facilities of the College 
of Engineering, the Department's Pattern 
Recognition and Image Processing Labora¬ 
tory, Artificial Intelligence/Knowledge Based 
Systems Laboratory, and the Advanced 
Computing Systems Laboratory. Additional 
facilities available to faculty include an IBM 
3090/180E vector processor, a 4 processor 
Convex C-240, a 96-node BBN GP-1000, 
45-node BBN TC-2000, and a 64-node 
nCUBE. Access to select off-campus com¬ 
puters is available. 

Michigan State University enjoys a park¬ 
like campus of 2,100 developed acres and 
3,100 acres of experimental farms, outlying 
research facilities and natural areas. The 
campus is adjacent to the cities of East Lan¬ 
sing and the capital city, Lansing. The 
Greater Lansing area has approximately 
250,000 residents. The communities have 
fine school systems and place a high value 
on education. 

Applicants should send a letter of intent, 
resume, the names of three references, and 
a statement of research and teaching in- 

Dr. Anthony S. Wojcik. Chairperson 

Department of Computer Science 
A714 Wells Hall 
Michigan State University 
East Lansing. Michigan 48824-1027 
wojcik@cps.msu.edu 
Michigan State University is an Equal Op¬ 
portunity/Affirmative Action Institution and 
encourages applications from women and 
members of ethnic minority groups. Position 
Numbers: ENG 158 and ENG 199. 


DATA BASE 

ANALYST / PROGRAMMER 

Data Base Analyst/Programmer needed 
to design, develop, and implement new 
systems software utilizing computer pro¬ 
gramming languages such as RPG400, 
SYNON, and AS/400 Control Language. 
Develop, implement and maintain the inter¬ 
face between AS/400 and ZEBRA-130, 
SATO M-8400 barcode machine and 
COLOG time attendance machine. Docu¬ 
ment technical design specifications and pro¬ 
prietary software requirements including 
hierarchal and relational database accessing 
and retrieval. Conduct special studies and in¬ 
vestigations pertaining to development of 
new information systems to meet current 
and project development of new information 
systems to meet current and projected 
needs. Requires a Bachelor's degree in 
Computer Science/Systems Analysis and 
three years experience in job offered or three 
years directly related experience in Rela¬ 
tional Database and Normalization, 4th 
Generation Language, SYNON and PC. Or 
will consider a Master's degree in Computer 
Science/Systems Analysis or its equivalent 
and one year experience in job offered or 
one year experience in Relational DataBase 
and Normalization, 4th Generation Lan¬ 
guage, SYNON and PC in lieu of the Bache¬ 
lor and three years. 40 hour work week. 
$31,000.00 per year. Apply at the Texas 
Employment Commission, Dallas, Texas, or 
send resume to the Texas Employment 
Commission, TEC Building, Austin, Texas 
78778, Job Order #6344583. Ad Paid by an 
Equal Opportunity Employer. 


UNIVERSITY OF NEVADA 
Computer Science Chair 

The Howard R. Hughes College of Engi¬ 
neering as the University of Nevada, Las 
Vegas invites applicants for the position of 
Chair. Department of Computer Science. In 
addition to administrative and teaching 
duties, the Chair will assist in the develop¬ 
ment of funded research opportunities and 
direct the initiation and development of the 
recently approved Ph D. program in Com¬ 
puter Science. Applicants must have a 
Ph.D. in Computer Science, or a closely 
related area, and a distinguished record in 
teaching, research, and service. The appli¬ 
cant must have experience in advising and 
directing doctoral students and be familiar 
with the academic requirements of B.S., 
M.S., and Ph.D programs in Computer Sci¬ 
ence. Administrative experience at the 
Department or College level is desirable. 

UNLV is a growing university with over 
19.500 students and rapidly expanding 
academic and research programs of state 
and national significance. Engineering and 
computer science programs are housed in the 
new $15,000,000 Thomas T. Beam Engi¬ 
neering Complex. AH programs have a 
strong research base with steadily growing 
undergraduate and graduate enrollments. 
Computer Science has ten faculty. 220 
undergraduate and 45 graduate students. 
Faculty are currently active in several spon¬ 


sored research areas including: optical 
character recognition; document storage 
and retrieval; computational geometry 
algorithms; computer graphics applications; 
and simulation. 

The department has modern, well equip¬ 
ped laboratory facilities including: SUN 
SPARC, NeXT, Silicon Graphics 4D, and 
DEC workstations, a Symbolics LISP 
machine, and Intel Hypercube, transputers, 
and an Ethernet Local Area Network. In ad¬ 
dition, faculty have access to a CRAY 
Y-MP2 in the National Supercomputing 
Center for Energy and the Environment 
operated by the College. 

The appointment will begin August 17, 
1992. Applicants should submit a letter of 
application, a complete vita, and the names, 
addresses and telephone numbers of four 
references. Review of applications will begin 
February 1, 1992 and continue until the 
position is filled. Please send applications to: 
Dr. Walter C. Vodrazka, Chair, Computer 
Science Search Committee, Howard R. 
Hughes College of Engineering, University 
of Nevada, Las Vegas, 4505 Maryland Park¬ 
way. Las Vegas, NV 89154. (702) 739-3699. 
E-Mail address: dodgers@UNLV.edu. 
Salary is competitive. 

UNLV is an Equal Opportunity, Affirma¬ 
tive Action Employer and encourages 
women and minorities to apply. UNLV em¬ 
ploys only U.S. citizens and aliens author¬ 
ized to work in the U.S. 


OPERATIONS RESEARCH ANALYST 
$34,860/yr. 

Duties: Provide analytical and technical 
studies and support to the Planning Group, 
including: develop, analyze, evaluate, 
modify and recommend cost estimate and 
scheduling systems, funding plans and 
schedules for projects by using the Critical 
Path Method and Program Evaluation and 
Review Techniques and in association with 
scientists, engineers and management per¬ 
sonnel; participate in the preparation of 
reports for the Department of Energy to 
evaluate the progress and funding profile of 
the project by integrating internal and exter¬ 
nal publications, personnel list and funding 
request spreadsheets based on information 
provided by different groups in the Lab; and 
conduct analyses on technical apparatus 
production and installation by computer 
simulation models. 

Requirements: B.S. in Industrial Engi- 
neering and Operations Research; 1 year 
college-level training in Business Administra¬ 
tion; and 1 year experience in job offered or 
1 year experience in working as Computer 
Information System Programmer, involving 
development of cost and scheduling plans 
relating to development and operation of 
accelerator facilities, and using APL and 
UNDO computer languages and PC-DOS, 
Macintosh, UNIX and VMS programming 
environment. 

Job Site: Dallas, Texas 

Apply at the Texas Employment Commis¬ 
sion, Dallas, Texas, or send resume to the 
Texas Employment Commission, TEC Build¬ 
ing, Austin, Texas 78778, J.O. #6422246. 
Ad Paid by an Equal Opportunity Employer. 
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NAVAL POSTGRADUATE SCHOOL 
Chair of Department 

The Computer Science Department of the 
Naval Postgraduate School seeks a Depart¬ 
mental Chair, with term beginning no later 
than 30 September 1992. The ideal candi¬ 
date holds a Ph.D. in Computer Science, is 
from an academic environment, and main¬ 
tains a continuing, strong involvement in re¬ 
search. We desire a candidate who facilitates 
the activities of departmental faculty and 
exerts leadership in developing department 
and interdisciplinary programs. The candi¬ 
date should possess strong interpersonal and 
written communication skills. The candidate 
must be an excellent teacher. 

The department offers graduate degrees 
with specialization tracks in artificial in¬ 
telligence and robotics, computer graphics 
and visual simulation, computer systems and 
architectures, database and data engineer¬ 
ing, and software engineering. The depart¬ 
ment currently has major applied research 
funding in software engineering, computer 
graphics/virtual reality, and artificial in¬ 
telligence/robotics. We believe that our 
future departmental growth areas will be in 
the areas of software engineering/real-time 
systems, computer graphics/virtual reality, 
and secure distributed systems. Conse¬ 
quently, we prefer a chair whose research 
area matches with one of our areas of growth 
though we will consider exceptional candi¬ 
dates from any area. U.S. citizenship is re¬ 
quired. Applicants interested in the position 
should forward their resume to: 

Associate Professor Michael J. Zyda 
Chair, Chair Search Committee 
Naval Postgraduate School 
Code CS/Zk, Department of Computer 

Monterey, California 93943-5100 
(408) 646-2305 

Email: zyda@trouble.cs.nps.navy.mil 
The Naval Postgraduate School is an 
Equal Opportunity/Affirmative Action 
Employer. 


DEVELOPMENT STAFF MEMBER 

Conduct research on the design of algo¬ 
rithms, particularly related to routing and 
network location for distributed systems. 
Provide design, development, and testing of 
communication protocols and utilities for 
heterogenous networking. Provide technical 
leadership for the design and development 
of network operating systems. Job respon¬ 
sibilities include architecture presentations, 
technical education to team members, and 
advanced end-user interfaces based on com¬ 
puter graphics. 40 hours/week, $61,000/ 
year. Must have Ph.D. in Computer Sci¬ 
ence, 1 year experience on the job or 1 year 
experience as a pre- or post doctoral re¬ 
search assistant/or systems programmer. 
The 1 year related experience must include 
design and development of communications 
protocols for distributed computing, Net¬ 
work Operating Systems, database applica¬ 
tions, microprocessor simulation, computer 
graphics systems, and adapting DECNET 
networks. Must have knowledge of distri¬ 
buted algorithms as evidenced by course 
transcript or letter of employment. Apply at 


the Texas Employment Commission, Austin, 
Texas, or send resume to the Texas Employ¬ 
ment Commission, TEC Building, Austin, 
Texas 78778, J.O.*: 6449022. Ad paid by 
an equal employment opportunity employer. 


UNIVERSITY OF WASHINGTON 

Department of Computer Science 
and Engineering 

The Department of Computer Science 
and Engineering at the University of 
Washington expects to have one or more 
tenure-track openings starting in the 
1992-93 academic year. We seek outstand¬ 
ing applicants who add to our existing 
research strengths, particularly in compilers, 
computer systems engineering, and software 
engineering, or who bring significant new 
research strength to our department. 

A moderate teaching load allows time for 
quality research and close involvement with 
students. We expect applicants to have a 
strong commitment to both research and 
teaching, and an outstanding record of 
research for their level. 

Interested applicants should send a letter 
of application, a resume, and the names of 


four references to Faculty Recruiting Com¬ 
mittee, Department of Computer Science 
and Engineering FR-35, University of Wash¬ 
ington, Seattle, Washington 98195. Candi¬ 
dates are encouraged to apply as early as 
possible. 

The University of Washington is an Affir¬ 
mative Action/Equal Opportunity Employer. 
The Ph.D. is required for these positions. 


UNIVERSITY OF UTAH 

Department of Computer Science 

Applications are invited for one tenure- 
track position in Computer Science, avail¬ 
able in July 1992. The position can be either 
in the area of Systems or in Computer 
Graphics. All applicants should have a Ph.D. 
in Computer Science. A strong commitment 
to research and teaching is required. Send 
Curriculum Vita and names of at least three 
references to: Faculty Recruiting Committee 
c/o Shawn Darby, Department of Computer 
Science, 3190 MEB, University of Utah, Salt 
Lake City, Utah 84112. The University of 
Utah is an Equal Opportunity, Affirmative 
Action Employer and encourages nomination 
and application from women and minorities. 


Director 

Center for Computational Sciences 
Oak Ridge National Laboratory 


Oak Ridge National Laboratory 
(ORNL) has recently established a 
Center for Computational Sciences 
(CCS) to conduct grand challenge- 
scale research, to manage and oper¬ 
ate the required high performance 
computing systems, and to support 
educational initiatives. ORNL is 
seeking an individual of national 
stature to direct this Center. As the 
initial Director, this individual with 
play a significant role in defining 
the CCS. The Director will report to 
an ORNL Associate Director. 
Primary responsibilities: 

• Provide leadership in computa¬ 
tional science on a national level 

• Provide overall management 
and/or coordination of the activi¬ 
ties under the Partnership in Com¬ 
putational Science, consisting of 
several national laboratories, uni¬ 
versities, and industrial affiliates 

• Serve as principal contact for CCS 
activities related to the Federal 
High Performance Computing 
and Communications Program 


Desirable qualifications: 

• Scientist with PhD or equivalent 
with demonstrated expertise in 
solving large-scale computational 
science problems 

• Senior level manager with proven 
ability to lead and manage a 
diverse, multidisciplinary research 
activity in computational science 

• Knowledge of federal and indus¬ 
trial programs, initiatives, and 
policies 

A search committee for this posi¬ 
tion will be established to identify 
appropriate candidates and review 
applications. Applications should 
be sent to: 


Dr. Bill R. Appleton, Oak Ridge 
National Laboratory, P.O. Box 2008 
Oak Ridge, TN 37831-6240 



February 1992 











UNIVERSITY OF ALBERTA 

Department of Computing Science 

Applications are invited for a tenure-track 
position at the Assistant Professor level in the 
areas of computer graphics and/or human 
interfaces. Excellent senior applicants will 
also be considered in these areas. Responsi¬ 
bilities include research as well as teaching at 
both the graduate and undergraduate levels. 

This young and energetic department has 
recently undergone significant expansion 
and consists of 35 academic and 29 support 
staff. Current hardware support for research 
includes a local area network consisting of a 
MIPS M/120 and Silicon Graphics 4D/340S 
computer connecting over 75 Sun Worksta¬ 
tions and a large number of terminals. Cur¬ 
rent computer graphics facilities include two 
Silicon Graphics workstations^ Data Glove, 
a head-mounted display, a single frame 
video recorder and two colour sun worksta¬ 
tions. The department also operates a Myrias 
SPS-2 massively-parallel computer as a re¬ 
search tool. Well-supported laboratories also 
exist for AI, database, distributed systems, 
image processing, programming languages 
and robotics research. 

The current salary range minimum is 
$C38,955 with the appointment level being 
commensurate with qualifications and ex¬ 
perience. Send curriculum vitae, the names 
of three references and up to three reprints 
or copies of important publications. New 
Ph.D.'s should include a copy of their 
transcript. 

Apply to: 

Dr. Paul G. Sorenson 
Department of Computing Science 
University of Alberta 
Edmonton, Alberta, Canada 
T6G 2HI 

Applications will be accepted until Febru¬ 
ary 29. 1992. 

In accordance with Canadian Immigration 
requirements, priority will be given to Cana¬ 
dian citizens and permanent residents of 
Canada. 

The University of Alberta is committed to 
the principle of equity in employment. The 
University encourages applications from 
aboriginal persons, disabled persons, mem¬ 
bers of visible minorities, and women. 


SENIOR RESEARCH SCIENTIST 

Research and development in the areas of 
neural networks, time series prediction, and 
non-linear system identification. Develop 
neural network-based time series analysis 
and prediction models. Develop new neural 
network architectures and learning algo¬ 
rithms. 40 hours per week. 9:00 am to 6:00 
pm: $5,000 per month. Requires Ph.D. in 
Electrical Engineering with major in neural 
networks, and 6 months experience in the 
job offered, or as a Research Associate, and 
a minimum of 1 published research paper in 
each neural networks and system identifica¬ 
tion. Apply at Texas Employment Commis¬ 
sion. Houston. Texas or send resume to 
Texas Employment Commission. TEC Build¬ 
ing. Austin, Texas 78778, J.O. #6521516. 
Ad paid by an Equal Opportunity Employer. 


UNIVERSITY OF CENTRAL FLORIDA 
Computer Engineering 
Faculty Position in Digital 
Systems/Computer Architecture 

The University of Central Florida College 
of Engineering invites applicants for a 
tenure-track Assistant Professor position in 
its Department of Computer Engineering 
beginning with the Fall term of 1992. A 
Ph.D. in Computer Engineering or a related 
discipline is required. UCF is a member of 
the State University System of Florida and 
has a current enrollment in excess of 22,000 
students. The University is developing a 
Research Park adjacent to its main campus 
to support high-technology government and 
industrial activity in the Central Florida area. 
Computer facilities for instruction and re¬ 
search include a Sun Workstation Network, 
an NCUBE Parallel Machine and numerous 
mini and micro computers. An Ethernet 
LAN links College of Engineering offices and 
labs together. 

Send Curriculum Vita and the names of 
three references postmarked by March 15, 
1992. to: 

Dr. C.S. Bauer, Chair 
Department of Computer Engineering 
University of Central Florida 
P.O. Box 25,000 
Orlando, FL 32816 
Phone: (407) 823-2236 
FAX 407-823-5483 
E-mail: csb@engr.ucf.edu 
The University of Central Florida is an 
Equal Opportunity/Affirmative Action em¬ 
ployer. As an agency of the State of Florida, 
the University makes all application materials 
and selection procedures available for public 


UNIVERSITY OF SOUTHWESTERN 
LOUISIANA 
Position Announcement 
Head of Computer Science and 
Associate Director of the Center for 
Advanced Computer Studies 

Applications are invited for the joint posi¬ 
tion of Department Head of Computer Sci¬ 
ence and Associate Director of the Center for 
Advanced Computer Studies. Computer Sci¬ 
ence has been a flagship program at USL 
since the late 1960s. The formation of the 
Center for Advanced Computer Studies at 
USL in the mid-80s extended the program to 
include an international research mission. 
The University enrollment for the Fall 1991 
semester totaled approximately 16,000. 

DESCRIPTION OF THE POSITION: The 
Department of Computer Science is an 
academic unit within the College of Sci¬ 
ences. with five full-time faculty positions, 
seven one-half time instructor positions and 
approximately 400 majors. This Department 
is responsible for administering an excellent 
undergraduate curriculum that has been ac¬ 
credited since 1986. The Center for Ad¬ 
vanced Computer Studies is a separate aca¬ 
demic unit with approximately 200 graduate 
students and 25 graduate faculty. The posi¬ 
tion to be filled reports jointly to the Dean of 
the College of Sciences and the Director of 
the Center for Advanced Computer Studies. 


The incumbent is in a unique position to 
direct an excellent undergraduate computer 
science program and to play an active role in 
the graduate and research programs of the 
Center for Advanced Computer Studies. 
Computer science and engineering labora¬ 
tory facilities include LANs that connect over 
120 Sun workstations, several file servers, 
and an assortment of special laboratories for 
parallel processing, robotics, and VLSI 
design. This position will provide outstand¬ 
ing professional opportunities and a com¬ 
petitive salary. 

QUALIFICATIONS: 

(1) a Ph.D. in computer science or a related 
field: 

(2) credentials to qualify for rank of associate 
or full professor: 

(3) a proven commitment to excellent 
undergraduate and graduate education 
and research; 

(4) proven strong leadership; 

(5) evidence of administrative ability. 
APPLICATION CLOSING DATE: Appli¬ 
cants should send a letter of application con¬ 
taining statements of their academic and ad¬ 
ministrative philosophies; a resume, and 
names of at least three references. For initial 
consideration, applications should be re¬ 
ceived by March 6, 1992. 

Please send applications to: 

Dr. Steve Landry, Chair 
CMPS/CACS Search Committee 
P.O. Box 43610 
Lafayette, LA 70504 
spl@usl.edu 


MEMBER OF TECHNICAL STAFF 

Apply theories of distributed, concurrent 
and real-time computation, including tem¬ 
poral logic, to the specification of the soft¬ 
ware component of large complex systems 
that are distributed in nature. Develop 
methodologies for the computer assisted 
synthesis of high level designs from these 
specifications via temporal logic decision 
procedures and model checking. Develop 
methodologies for the computer assisted 
translation of these high level designs into 
low level architecture specific designs via 
algebraic transformations. Investigate the 
utility of the notion of fairness in the develop¬ 
ment of these methodologies. Participate in 
the implementation of software tools based 
upon the Raddle model of computation. 
M.S. or equivalent in Computer Science re¬ 
quired. One year experience in job offered 
or in research related to semantics of concur¬ 
rent languages required. Must have success¬ 
fully completed minimum of one graduate- 
level course in each of the following: mathe¬ 
matical logic: formal semantics and verifica¬ 
tion; communicating sequential processes; 
logics of programs; theory of predicate 
transformer semantics: automata and formal 
languages; distributed systems: distributed 
computing. Hours are 8:00 a.m. to 5:00 
p.m., 40 hours weekly, at an annual salary 
of $59,200. Apply at the Texas Employ¬ 
ment Commission. Austin, Texas, or send 
resume to the Texas Employment Commis¬ 
sion, TEC Building, Austin, Texas 78778, 
J.O. #6599101. Ad Paid by an Equal Op¬ 
portunity Employer. 
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COMPUTER COMMUNICATION 
SPECIALIST 
$37,500/yr. 

Duties: Build an object-oriented interac¬ 
tive graphical user environment for simulat¬ 
ing super collider, a system which consists of 
(a) a pictorial static representation of lattice 
elements of the entire SSC Accelerator 
Complex; (b) an object-oriented graphical 
representation of static simulation including 
plotting of lattice optics functions; and (c) 
dynamic simulation. 

Requirements: M S. degree in Compu- 
ter Science Engineering; and 2 years experi¬ 
ence in job offered or 2 years experience in 
object-oriented programming and CAD en¬ 
vironment, using C + +, Interview, Gnu 
g+ +, C, PASCAL, VHDL, Transact SQL, 
and FORTRAN languages; UNIX, VMS and 
SUNOS operating systems; and SYBASE 
relational management system, or 2 years 
college-level training in object-oriented 
programming. 

Job Site: Dallas, Texas 

Apply at the Texas Employment Commis¬ 
sion, Dallas, Texas, or send resume to the 
Texas Employment Commission, TEC Build¬ 
ing, Austin, Texas 78778, J.O. *6422247. 
Ad Paid by an Equal Opportunity Employer. 


FACULTY FOR EUROPE AND ASIA 

Planning a sabbatical or leave of absence? 
The University of Maryland University 
College seeks excellent lecturers for under¬ 
graduate computer science, computer appli¬ 
cations, and information systems manage¬ 
ment courses on U.S. military bases in 
Europe and in Asia and the Pacific. Renew¬ 
able annual appointments begin August 
1992. Minimum requirements include a 
master's degree in computer science or a 
related field, recent college teaching ex¬ 
perience, and U.S. citizenship. Benefits in¬ 
clude transportation and military base 
privileges. Frequent travel and the cost of 
schooling make these positions difficult for 
those with children. Send resume to: Dr. 
Ralph E. Millis. Overseas Programs, The 
University of Maryland University College, 
College Park. MD 20742-1642. AA/EEO. ' 


THE CENTER FOR ADVANCED 
COMPUTER STUDIES 
University of Southwestern Louisiana 
Faculty Positions 
Graduate Fellowships 

Candidates with a strong research record 
and earned doctorate in Computer Science 
or Computer Engineering are invited to apply 
for a tenure track senior position available 
starting in Fall 1992. One appointment is 
planned for the rank of Professor. Con¬ 
sideration will also be given to exceptional 
candidates at the Assistant Professor rank. 
Candidates for the senior position must have 
established publications and grant creden¬ 
tials. Consideration will be given to all 
outstanding applicants, but preferred areas 


of interest are software engineering, com¬ 
puter networks, operating systems, data¬ 
bases, computer architecture, artificial in¬ 
telligence, and theoretical computer science. 

Our typical teaching load is two graduate- 
level courses per year and a continuing 
research seminar. Substantial State and Uni¬ 
versity funds are available to support re¬ 
search initiation efforts. Salaries are com¬ 
petitive along with excellent support directed 
towards attainment of faculty professional 
goals. Our colloquium series brings typically 
eight world known professionals to our 
campus each year. 

A number of Ph.D. Fellowships valued at 
up to $18,000 per year including tuition and 
fees are available. They provide support for 
up to four years of study towards the Ph.D. 
in Computer Science or Computer Engi¬ 
neering. Eligible candidates must be U.S. 
citizens or hold an earned MS degree from a 
U.S. or Canadian University. Recipients also 
receive preference for low-cost campus 
housing. 

The Center is a graduate research center 
of 36 faculty and staff with programs leading 
to MS/Ph.D. degrees in Computer Science 
and Computer Engineering. The Center is 
located in Acadiana about 100 miles west of 
New Orleans. The Center has a network of 
70 Sun workstations dedicated for instruc¬ 
tional and research purposes. It also has the 
following five specialized laboratories equip¬ 
ped with state-of-the-art computing facilities: 
VLSI Research; Computer Vision and Pat¬ 
tern Recognition; Intelligent Robotics 
Systems; Software Research; and Artificial 
Intelligence. More than 220 students are 
enrolled in computing graduate programs, 
including over 100 Ph.D. students. The 
undergraduate program in the Computer 
Science Department is accredited by CSAB 
and offers both scientific and commercial op¬ 
tions, with a current enrollment of 280. The 
undergraduate program in the Electrical and 
Computer Engineering Department is ac¬ 
credited by ABET, and offers an option in 
Computer Engineering, with a current en¬ 
rollment of 155. 

To apply, send a copy of your resume and 
the names and addresses of at least three 
professional references to: Dr. Michael C. 
Mulder, The Center for Advanced Compu¬ 
ter Studies, USL, Lafayette, LA 70504-4330; 
Telephone: (318) 231-6284. Applications 
will be considered until all positions are filled. 

An Affirmative Action/Equal Opportunity 
Employer. 


AUBURN UNIVERSITY 
Department of Computer Science 
and Engineering 

The Department of Computer Science 
and Engineering invites applications for a 
tenure-track faculty position at the Assistant 
or Associate Professor level, beginning Sep¬ 
tember 1992. Responsibilities include re¬ 
search, supervision of (M.S. and Ph.D.) 
graduate student research, and graduate 
and undergraduate teaching. Candidates 
should have a Ph.D. in computer science, 
computer engineering, or a closely related 
field. Although applicants in all areas of com¬ 
puter science and engineering will be con¬ 


sidered, preference will be given to those 
with research specialties in software engi¬ 
neering, computer networks, and computer 

The Department currently has eleven full¬ 
time faculty members and supports strong 
undergraduate and graduate programs. 
Faculty research areas include parallel com¬ 
putation, software engineering, artificial in¬ 
telligence, and computer networks. Depart¬ 
mental resources include a network of Sun 
workstations that is linked to the College of 
Engineering's Sun network. Parallel com¬ 
puting research is supported by the avail¬ 
ability of a network of 16 T800 Transputers 
that are accessible from any of the Depart¬ 
ment's workstations. Network access is also 
available to the general computer facilities of 
the University, and to the Alabama Super¬ 
computer Network’s Cray and nCUBE 
supercomputers. 

Auburn University is located in the city of 
Auburn in east-central Alabama, 100 miles 
southwest of Atlanta on 1-85. The current 
enrollment at this land-grant university is 
over 21,000 students. 

Applicants should send a curriculum vitae 
and the names, addresses, and telephone 
numbers of three references to: 

Professor Stephen B. Seidman 
Head, Department of Computer Science 
and Engineering 

Auburn University, AL 36849-5347 

Questions can be emailed to seidman @ 
cng.auburn.edu. Review of applications will 
begin on February 15, 1992 and continue 
until the position is filled. Auburn University 
is an affirmative action/equal opportunity 
employer; women and minorities are en¬ 
couraged to apply. 


PHILADELPHIA COLLEGE OF 
TEXTILES AND SCIENCE 
Faculty Position Announcement 
Computer Science 

A tenure-track position is available for the 
fall of 1992. Applicants should be en¬ 
thusiastic about teaching a variety of in¬ 
troductory and advanced computer science 
courses, as well as some introductory 
courses in mathematics. An interest in con¬ 
ducting an active research program involving 
undergraduate students is required. Position 
would entail further development of emerg¬ 
ing computer science and applied math¬ 
ematics curricula in conjunction with the 
overall mission of the college, to provide a 
liberal professional education. PCT&S offers 
undergraduate^and graduate programs in 
four academic divisions (Science, Textiles, 
Business, and Humanities). Facilities include 
VAX 8250 minicomputer (VMS), Silicon 
Graphics IRIS workstation (UNIX), and a 
number of microcomputer platforms. Ph.D. 
in computer science or related field required; 
salary and rank commensurate with qualifi¬ 
cations. Applicants should send a letter of 
application, vitae, statement of research ob¬ 
jectives. and the names, addresses and tele¬ 
phone numbers of three references by DATE 
to Office of Academic Affairs, Philadelphia 
College of Textiles and Science, Philadelphia, 
Pennsylvania 19144. Philadelphia College of 
Textiles and Science is an AA/EO institution. 


February 1992 
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OREGON GRADUATE INSTITUTE 

The Oregon Graduate Institute of Science 
& Technology invites applications for a new 
permanent faculty position in the Depart¬ 
ment of Computer Science and Engineering. 
Applicants should have a Ph.D. in computer 
science and 3 or more years of experience in 
research in the area of database systems, 
although exceptional junior candidates will 
be considered. 

OGI is a private research university 
located a few miles west of Portland, 
Oregon. OGI encourages serious scholar¬ 
ship by providing strong administrative sup¬ 
port and excellent research facilities. 
Because OGI offers only graduate degrees, 
faculty have no undergraduate teaching 
responsibilities. The Department currently 
has 12 full-time faculty members. 

It is anticipated that the’successful can¬ 
didate will participate in collaborative 
research with Battelle Pacific Northwest 
Laboratories of Richland, Washington in the 
area of scientific information management. 
Supported by the U.S. Department of 
Energy, PNL is expanding the role of com¬ 
puting research in its scientific mission, par¬ 
ticularly in the fields of environmental and 
molecular sciences. 

To apply, send a brief description of re¬ 
search interests, the names of at least three 
references and a resume to: Professor 
Richard B. Kieburtz, Chairman, Department 
of Computer Science and Engineering, Ore¬ 
gon Graduate Institute of Science & Tech¬ 
nology, 19600 NW von Neumann Drive, 
Beaverton, OR 97006, (503) 690-1150, 
csedept@cse.ogi.edu. 

bGl is an Equal Opportunity Employer. 


CONCORDIA UNIVERSITY 
Department of Electrical and 
Computer Engineering 

The Department of Electrical and Com¬ 
puter Engineering at Concordia University 
invites applications for a tenure track faculty 
position in Computer Engineering at the 
Assistant or Associate Professor level. Ap¬ 
plicants must have a Ph.D. in Electrical 
Engineering, or Computer Engineering or 
Computer Science. The position involves 
teaching at the undergraduate and graduate 
levels and independent research. The 
department offers bachelor's programs in 
Electrical Engineering and Computer 
Engineering and Master's and Ph.D. pro¬ 
grams in Electrical Engineering. We are 
primarily looking for candidates with 
teaching interests and demonstrated 
research expertise in the general area of 
Parallel and Distributed Systems with an 
emphasis on one of the following: Parallel 
Architectures. Parallel and Distributed 
Computing and Computer Systems Perfor¬ 
mances Evaluation. The department has ex¬ 
cellent research facilities and expertise in 
several other areas which include Computer 
Communications. Protocol Engineering, 
Robotics and Control, Computational 
Graph Theory, Neural Computing, VLSI 
and Signal Processing. The faculty members 
of the department have close links with 
several research groups within and outside 


the university: GRIAO, Montreal Inter¬ 
university Research Center on High Perfor¬ 
mance Architectures and VLSI, Centre de 
Recherche Informatique de Montreal 
(CRIM) and MICRONET (National Network 
of Centers of Excellence in VLSI). 

In accordance with Canadian Immigration 
requirements, priority will be given to citizens 
and permanent residents of Canada. Con¬ 
cordia University is committed to employ¬ 
ment equity and encourages applications 
from women, aboriginal people, visible 
minorities and disabled persons. All things 
being equal, women candidates, shall be 
given priority. Applications will be accepted 
until the position is filled. 

Applicants should send a resume and 
names of at least three references to: 

Dr. P.D. Ziogas, Chairperson 
Department of Electrical and Computer 
Engineering 
Concordia University 
1455 de Maisonneuve Blvd. West 
Montreal, Quebec 
Canada 
H3G 1M8 


UNIVERSITY OF VERMONT 
Faculty Positions 

Applications are invited for two tenure- 
track faculty positions in Computer Science 
at the level of assistant professor for the 
1992-93 academic year. Responsibilities will 
include instruction in mainstream computer 
science and the development of a quality 
research program. Candidates should show 
promise of excellence in both teaching and 
research; have demonstrable expertise in 
networks and distributed systems, parallel 
algorithms and systems or database and 
knowledge base systems; and have a strong 
interest in interdisciplinary research in 
mathematics. Faculty are encouraged to 
supervise graduate students in related fields 
as well as in computer science. A doctorate 
in computer science or a closely related field 
is required. 

Applications will be accepted until the 
positions are filled. Please submit resume 
and description of current research interests, 
and arrange to have three letters of recom¬ 
mendation sent directly to Dr. Richard 
Foote, Search Committee Chairperson, 101 
Votey, College of Engineering & Mathema¬ 
tics, University of Vermont, Burlington, VT 
05405. The University of Vermont is an Af¬ 
firmative Action Equal Opportunity Em¬ 
ployer and encourages applications from 
women and members of minority groups. 


WRIGHT STATE UNIVERSITY 
Department of Computer Science 
and Engineering 

Wright State University, Department of 
Computer Science and Engineering invites 
applicants for tenure-track faculty positions 
at all ranks. The successful candidate should 
have a Ph.D. in computer science, computer 
engineering, or equivalent degree, and have 
demonstrated forward looking and creative 
research as evidenced by recent journal 


publications. Further desired attributes in¬ 
clude: capability to direct Ph.D candidates 
and/or direct research projects. Preferred 
technical areas are distributed systems, net¬ 
working, parallel computing, and database, 
but other areas will be considered. 

The university is located in a high technol¬ 
ogy environment among industrial/military 
research and development facilities, in¬ 
cluding Wright Patterson Air Force Base and 
NCR. Department strengths include a fully 
networked Unix environment of Sun & DEC 
workstations; Cray access; and graduate 
laboratories in AI, optical computing, neural 
networks, and robotics; degree programs in 
both computer science and computer engi¬ 
neering; and approximately 150 selected 
graduate students. The department has 25 
faculty, four of whom were awarded NSF 
research initiation awards last year. 

Please submit a detailed resume including 
names of three references to: CSNET ad¬ 
dress - amcaulay@cs.wright.edu or Alastair 
D. McAulay, NCR Distinguished Professor 
and Chair, Department of Computer Sci¬ 
ence and Engineering, Wright State Univer¬ 
sity, Dayton, Ohio 45435. Pending avail¬ 
ability of funding, reviewing for positions will 
begin February 15, 1992 and continue 
monthly until positions are filled or until 
September 1, 1992. 

AN EQUAL OPPORTUNITY/AFFIR¬ 
MATIVE ACTION EMPLOYER. 


RESEARCH ASSOCIATE 

A limited term (2 years) position working 
with a group of researchers on the Touring 
Machine project at Bellcore's Information 
Networking Research Laboratory. The goal 
of the Touring Machine project is to provide 
a general, reliable, and flexible software plat¬ 
form that supports a large class of distributed 
multimedia applications. The experimental 
testbed is composed of a network of desktop 
video and audio devices controlled via users' 
workstations. In addition to being a research 
testbed, the system is used on a daily basis as 
a communications tool by over 100 users at 
two Bellcore locations, with a goal of creating 
a national network experiment with hundreds 
of users. Multimedia conferencing and 
CSCW applications are currently supported. 

The Associate will have responsibility for 
designing and implementing various aspects 
of the continually evolving system, including 
support for the large user community. Can¬ 
didate must have a Masters degree in Com¬ 
puter Science or related field with a mini¬ 
mum of two years’ programming experience 
in C, Unix, and X-Windows. Experience in 
distributed systems and object-oriented pro¬ 
gramming is desirable. 

Job offers the opportunity to participate in 
a major research project. Computing and 
communications facilities are state-of-the- 
art. Compensation and benefits are compet¬ 
itive. Women and minority candidates are 
strongly encouraged to respond. 

To explore this opportunity, send a letter 
of interest and a resume to: Manager, Tech¬ 
nical Employment, Dept. 123/0406/92, 
Bellcore. RRC 4C131, 444 Hoes Lane, Pis- 
cataway, NJ 08854. An equal opportunity 
employer. 
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DEVELOPMENT STAFF MEMBER 

Engage in research and development ef¬ 
forts in the highly complex and sophisticated 
areas of parallel and distributed computing. 
Investigate new ways of parallel and distrib¬ 
uted computing for low-end systems; and 
participate in research and development of a 
multi-processing operating system in a multi¬ 
user Micro Channel based system. 40 hours/ 
week, $60,000/year. Must have Ph.D. in 
Computer Science, one year experience on 
the job or one year experience as a pre or 
post doctoral research assistant. The one 
year related experience must include re¬ 
search and development in object-oriented 
operating systems; real-time operating sys¬ 
tems; operating system for parallel pro¬ 
cessors; and real-time programming lan¬ 
guages as well as use of a wide range of 
technology, including Unix operating system 
internals/device drivers; real-time schedul¬ 
ing techniques; compiler construction 
techniques/tools; parallel processing ar¬ 
chitectures; and virtual memory manage¬ 
ment systems. Apply at the Texas Employ¬ 
ment Commission. Austin, Texas or send 
resume to the Texas Employment Commis¬ 
sion, TEC Building, Austin, Texas 78778, 
J.O. # 64490'21. Ad paid by an equal 
employment opportunity employer. 


INTERNATIONAL COMPUTER 
SCIENCE INSTITUTE 

High-speed Networking Researcher 

The International Computer Science Insti¬ 
tute (1CS1) is seeking an outstanding re¬ 
searcher to lead its efforts in high-speed net¬ 
working. 1CSI is an independent, non-profit 
basic research institute located in Berkeley. 
California, and affiliated with the University 
of California at Berkeley. The Institute strives 
to maintain basic research projects of the 
highest standard in selected areas of com¬ 
puter science and engineering within the 
context of international cooperation. Cur¬ 
rent emphasis is on distributed and parallel 
computation. All of the research at the in¬ 
stitute is open, and its results are made 
publicly available. ICSI is supported by Ger¬ 
man. Swiss, and Italian governmental agen¬ 
cies and industrial organizations, and by a 
variety of U.S. sources: participation by 
other countries is under consideration. 

Networking research at ICSI is conducted 
in strict cooperation with the University of 
California at Berkeley. Current joint projects 
include participations in the BLANCA Test¬ 
bed of the Gigabit Testbed Initiative (using 
the Xunet ATM-based coast-to-coast experi¬ 
mental internetwork), in the Sequoia 2000 
project, which is sponsored by the Digital 
Equipment Corporation and involves collab¬ 
oration among four U.C. campuses, and in 
the initial planning of the San Francisco Bay 
Area Metropolitan Gigabit Network. 

Involvement in International networking 
projects may also be considered in ICSI's 
future plans. Topics being currently in¬ 
vestigated include the design and implemen¬ 
tation of, and experimentation with, real¬ 
time communication protocols, the design, 
evaluation, and use of parametric, network- 


modifiable video compression algorithms, 
the development of flow and congestion 
control mechanisms for high-speed wide- 
area networks, and the analysis of multiplex¬ 
ing policies to be used at the entrances into 
an ATM backbone. 

Applicants should have a Ph.D., a strong 
record of research accomplishments, excel¬ 
lent leadership and organizational qualities, 
and good interpersonal communication skills. 
The appointee will work with ICSI post-docs 
and visiting researchers, and with faculty 
consultants and graduate students from the 
University of California at Berkeley. If the 
appointee is interested, it may be possible to 
arrange for a temporary teaching position at 
the University. Salary will be commensurate 
with experience and qualifications. 

Please send resume, copies of selected 
publications, and the names and addresses 
of three references, to: 

Professor Domenico Ferrari 
International Computer Science Institute 
1947 Center Street, Suite 600 
Berkeley, CA 94704 
U.S.A. 

ICSI is an Equal Opportunity employer. 


COMPUTER SPECIALIST 
$36,536/yr. 

Duties: Support development and main¬ 
tenance of system level software for physics 
computing systems; survey physicists’ needs 
to determine system and software require¬ 
ments; perform analysis and design of new 
software using structured methodology; 
conduct acceptance test generation and 
quality assurance testing; and write 
documentation with documenting systems. 

Requirements: M.S. in Computer Sci- 
ence, with 3 credit hours each in Numerical 
Analysis, Simulation, Software Engineering 
and Computer Network; familiarity with fun¬ 
damental Physics Theory as evidenced by 2 
semesters or 6 credit hours of college-level 
Physics course; and 1 year experience in job 
offered or 1 year experience in system pro¬ 
gramming, involving use of UNIX system 
programming, X-Windows graphics tools 
programming, UNIX networking and FOR¬ 
TRAN and C languages. 

Job Site: Dallas, Texas 

Apply at the Texas Employment Commis¬ 
sion, Dallas, Texas, or send resume to the 
Texas Employment Commission, TEC Build¬ 
ing, Austin, Texas 78778, J.O. *6422245. 
Ad Paid by an Equal Opportunity Employer. 


WAYNE STATE UNIVERSITY 
Chair 

Department of Computer Science 

The Wayne State University Department 
of Computer Science invites applications 
and nominations for the position of Professor 
and Chair. Wayne State University, located 
in Detroit’s Cultural Center, is an urban, 
comprehensive research university serving 
34,000 students. The Department of Com¬ 
puter Science has sixteen faculty members. 


approximately 150 graduate students and 
350 undergraduates. It offers the Ph.D.. 
M.S., M.A., B.S., and B.A. degrees. 

The Department has recently moved to 
new quarters with modern laboratory facili¬ 
ties. Current research strengths lie in the 
areas of database systems, software engi¬ 
neering, artificial intelligence, computer 
vision and image processing, distributed 
systems, neural networks, biocomputing, 
numerical methods and natural language 
processing. The faculty have ties to industry 
(including local automotive and computer 
companies) and to several institutes for high 
technology associated with the University. 

The new Chair will be expected to provide 
strong leadership toward enhancing the na¬ 
tional standing of the Department’s research 
and teaching programs. Candidates must 
exhibit a distinguished research record, a 
commitment to teaching and strong admini¬ 
strative skills. A Ph.D. in Computer Science 
or a related field is expected. Applications 
from minority and women candidates are 
especially encouraged. Letters of applica¬ 
tion, curriculum vitae and names of at least 
three professional references should be sent 

Dr. Lawrence D. Favro 
c/o George Jones 
Office of the Dean 
College of Liberal Arts 
2226 Faculty Administration Building 
Wayne State University 
Detroit. MI 48202 

Review of applications will begin on Feb¬ 
ruary 15. 1992, but applications will be ac¬ 
cepted until the position is filled. 

Wayne State University is an Equal Op¬ 
portunity/Affirmative Action Employer. 


VICE PRESIDENT OF ENGINEERING 
$40,800/yr 

Duties: Overall direction of the technical 
aspects of company’s operation, including: 
planning market research projects and de¬ 
veloping marketing strategies for company’s 
products; supervising computer engineers 
and technicians engaged in sales, analyzing 
technical requirements of customers, identi¬ 
fying manufacturers in Taiwan, conducting 
product evaluation and cost estimation, 
testing, inspection, preparing technical pro¬ 
posals, creating sales promotion tools in¬ 
cluding demonstration programs, foils, users 
manuals, data sheet and company profile, 
and providing installation and maintenance 
services; and^rmulating contract negotia¬ 
tion strategies, supervising negotiation of 
purchase and sale agreements, and ad¬ 
ministering performance of contracts. 

Requirements: B.S. degree in Com¬ 
puter Science; 5 years experience in job of¬ 
fered involving administration of computer 
products in U.S. and Taiwan; and speak 
(Mandarin), read and write Chinese. 

Job Site: Houston, Texas 

Apply at the Texas Employment Commis¬ 
sion, Houston, Texas; OR send resume to 
the Texas Employment Commission, TEC 
Building, Austin, Texas 78778, J.O. 
*6344570. Ad Paid by an Equal Opportuni¬ 
ty Employer. 


February 1992 
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SYSTEMS ANALYST 

Direct, plan, and coordinate data process¬ 
ing activities for firm's PICK operating soft¬ 
ware network system. Analyze, define and 
recommend, computer equipment configu¬ 
rations and modifications to existing equip¬ 
ment and systems that will provide for the 
capability to accomplish proposed projects 
and allow for the efficient operation and ef¬ 
fective use of IBM compatible main frames. 
Study existing data handling systems to eval¬ 
uate effectiveness and develop new systems 
as required. Conduct special studies and in¬ 
vestigations pertaining to the development 
of new information systems to meet current 
as well as projected needs of company. Use 
PICK and Basic programming languages to 
program simple problems and clearly de¬ 
fined segments of large complex programs 
and assist with tests and in making revisions 
to eliminate errors. Implement PICK soft¬ 
ware package and interface new hardware 
as they are acquired. Requires a Master’s 
degree in MIS and one year experience in 
job offered or one year data processing pro¬ 
gramming systems design experience. Must 
have one year experience in each of the 
following: DBase III; Systems analysis and 
design; project management; statistical an¬ 
alysis and application experience; financial 
systems and computer simulation experi¬ 
ence. Must have nine credit hours in the fob 
lowing: telecommunications network design. 
40 hour work week. $2,505.00 per month. 
Apply at the Texas Employment Commis¬ 
sion, Dallas, Texas, or send resume to the 
Texas Employment Commission, TEC Build¬ 
ing. Austin, Texas 78778. Job Order 
'>'6422282. Ad paid by an Equal Opportuni¬ 
ty Employer. 


FLORIDA ATLANTIC UNIVERSITY 

Computer Science and Engineering 

The Department of Computer Science 
and Engineering, in the College of Engineer¬ 
ing, is seeking applicants and nominations 
for faculty positions at all levels. A doctorate 
in computer science, computer engineering, 
or a closely related field is required. 

A junior-level position requires demon¬ 
strated teaching ability and research poten¬ 
tial. A senior-level position requires a record 
of good teaching performance and high- 
quality research, with funded research 
especially desirable. While applicants with 
expertise in software engineering, artificial 
intelligence, microprocessor systems, and 
computer networks are preferred, all special¬ 
ities will be considered. Salaries, fringe 
benefits, and teaching loads are competitive. 
The appointments will begin in August 
1992; the closing date for applications is 
March 1, 1992, although applications will be 
reviewed until suitable candidates are found. 

Florida Atlantic University, which is a 
member of Florida's State University Sys¬ 
tem, is located in Boca Raton, on the Atlan¬ 
tic Ocean, midway between West Palm 
Beach and Fort Lauderdale. Many high-tech 
companies are located in this growing area, 
and opportunities for interaction with in¬ 
dustry are excellent. 


The Department currently has twenty-three 
regular faculty, with other visiting and 
research faculty normally in residence. 
Faculty are provided-workstations with net¬ 
work access to departmental, university, and 
international computer networks. Other 
research equipment may be acquired as 
necessary. 

The Department offers a CSAC-accredited 
bachelor’s program in computer science, 
master’s programs in computer science and 
in computer engineering, and doctoral pro¬ 
grams in computer science and in computer 
engineering; a bachelor’s program in com¬ 
puter engineering is in preparation. Over 
300 undergraduate, 200 master’s students, 
and 30 doctoral students are enrolled in the 
Department’s degree programs. The Univer¬ 
sity has over 14,000 students and offers a 
variety of bachelor’s, master’s, and doctoral 
programs. 

Applicants should send a resume, includ¬ 
ing the names and phone numbers of three 
professional references, along with a cover 
letter specifying the professional rank 
desired, to Faculty Search Committee, De¬ 
partment of Computer Science and Engi¬ 
neering, Florida Atlantic University, Boca 
Raton, FL 33431. Electronic mail communi¬ 
cations should be addressed to searchcomm 
@cse. fau.edu. 

Florida Atlantic University is an equal op¬ 
portunity/ affirmative action employer. Mem¬ 
bers of protected classes are encouraged to 
apply. 


McGILL UNIVERSITY 
Electrical and Computer Engineering 

The Department of Electrical Engineering 
of McGill University has a tenure-track open¬ 
ing at the assistant professor level in the area 
of computer engineering, which it would like 
to fill not later than September 1992. Ap¬ 
plications are invited from individuals who 
are dedicated to teaching, and who have 
outstanding research potential and demon¬ 
strated research achievements. Preference 
will be given to candidates with post-doctoral 
experience in software engineering. 

Candidates must have an earned Ph.D. 
degree, while a first degree in electrical 
engineering is desirable. Please send a 
resume and a list of 3 references to Professor 
Nicholas C. Rumin, Chairman, Department 
of Electrical Engineering, McGill University, 
3480 University St.. Montreal, Quebec, 
H3A 2A7. 

In accordance with Canadian Immigration 
requirements, this advertisement is directed 
in the first instance to Canadian citizens and 
Permanent Residents of Canada. 


COLORADO STATE UNIVERSITY 
Faculty Positions. Fall 1992 
Computer Science Department 

The Computer Science Department solic¬ 
its applications for tenure-track and visiting 
faculty positions at all levels (subject to fund¬ 
ing) . Candidates for assistant professor need 
a Ph.D. in computer science (at time of ap¬ 


pointment) with promise for excellence in 
research and teaching; applicants for senior 
ranks must possess distinguished research 
records. The Department has approval for 
significant growth over the next few years, 
and has identified selected areas in parallel 
computing, artificial intelligence, and soft¬ 
ware engineering for special attention. 
Salary is commensurate with rank and ex¬ 
perience. New and visiting faculty will enjoy 
duties especially conducive to productive 
research. 

The Department offers B.S., M.S., and 
Ph.D. degrees. We have excellent coopera¬ 
tive research relations with industrial and 
government laboratories, and their people 
form a significant portion of our graduate stu¬ 
dent population. We operate numerous 
multi-user systems (HP, DEC, Sequent) and 
many workstations (HP, IBM. Sun, Apple, 
ATT), all networked. University operations 
include IBM RS6000 servers and a visualiza¬ 
tion laboratory. Department personnel work 
is a pleasant, smoke-free environment. 

Fort Collins is a growing community of 
92.000 located along the foothills of the 
Rocky Mountains. 60 miles north of Denver. 
The climate is moderate—about 15 inches of 
precipitation and 290 days of sunshine per 
year. There are many cultural opportunities 
and year-round outdoor activities. 

Send your curriculum vitae and names of 
at least three professional references to: Dr. 
R.R. Oldehoeft, Search Committee, Com¬ 
puter Science Department. Colorado State 
University, Fort Collins. CO 80523. Appli¬ 
cations for August, 1992 will be considered 
March 1, 1992. The search may be extended 
if suitable candidates are not found. 

Colorado State University is an EEO/AA 
employer. EO Office: 21 Spruce Hall. 


UNIVERSITY OF FLORIDA 
Faculty Positions in 
Software Engineering 

The Computer and Information Sciences 
Department invites applications for tenured 
positions at senior and junior levels in the 
area of software engineering. One of the 
positions will have the assignment as the 
director of the Software Engineering Re¬ 
search Center in the NSF industry/University 
Cooperative Research Centers Program. 
Applicants must possess a doctoral degree in 
computer science or equivalence and show a 
strong record/commitment to teaching and 
research in software engineering and related 
areas. Some administrative and/or industrial 
experience for senior level candidates is 
desirable. The positions are available starting 
the 1992-1993 academic year or earlier. 

Applicants should send their resumes and 
the names and addresses of four references 
to Prof. Randy Chow. Chairman, Faculty 
Search and Screening Committee, Compu¬ 
ter and Information Sciences Department, 
301 CSE, University of Florida, Gainesville. 
FL 32611, phone: (904) 392-1200, e-mail: 
chow@cis.ufl.edu. 

Closing date: March 1st, 1992 or until the 
positions are filled. University of Florida is 
an equal opportunity/affirmative action em¬ 
ployer. This faculty search will be conducted 
in compliance with Florida’s “Government in 
the Sunshine Law." 
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THE FOLLOWING 
INFORMATION IS AVAILABLE: 


PUBLICATIONS AND 
ACTIVITIES 


Contact the Publications Office; 
to facilitate handling, please request by number. 

• Membership application, student #203, others #202 

• Periodicals subscription form for individuals #200 

• Periodicals subscription form for organizations #199 

• Publications catalog #201 

• Compmail electronic mail brochure #194 

• Technical committee list/application #197 

• Chapters lists, start-up procedures #193 

• Student scholarship information #192 

• Volunteer leaders/staff directory #196 

• IEEE senior member grade application #204 
(requires ten years practice and significant performance in five of those ten) 

To check membership status or reportachange of address, call the IEEE toll-free number, 
1 -800-678-4333.Direct all other Computer Society related questions to the Publ ications 
Office. 



The IEEE Computer Society advances the theory and practice of computer science and 
engineering, promotes the exchange of technical information among 100,000 members 
worldwide, and provides a wide range of services to members and nonmembers. 


Members receive the acclaimed monthly magazine Computer, discounts, and opportu¬ 
nities to serve (all activities are led by volunteer members). Membership is open to all IEEE 
members, affiliate society members, and others interested in the computer field. 


Computer. An authoritative, easy-to-read magazine 
containing tutorial and in-depth articles on topics across the 
computer field, plus news, conferences, calendar, interviews, 
and product reviews. 

Periodicals. The society publishes seven magazines 
and five research transactions. Refer to membership applica¬ 
tion or request information as noted above. 

Conference Proceedings, Tutorial 
Texts, Standard Documents. The Com¬ 
puter Society Press publishes more than 100 titles every year. 
Standards Working Groups. Over 100 of these groups produce IEEE 
standards used throughout the industrial world. 

Technical Committees. Morethan30TCs publish newsletters, provide 
interaction with peers in specialty areas, and directly influence standards, conferences, 
and education. 

Conferences/Education. The society holds about 100 conferences 
each year and sponsors many educational activities, including computing science 
accreditation. 

Chapters. Regular and student chapters worldwide provide the opportunity to 
interact with colleagues, hear technical experts, and serve the local professional 
community. 


Members experiencing problems — magazine delivery, membership status, or 
unresolved complaints — may write to the ombudsman at the Publications Office. 
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BOOK REVIEWS 


Editor: Alan Kaminsky, Rochester Institute of Technology, PO Box 9887, Rochester, NY 14623-0887 
(716) 475-5255; Internet ark@cs.rit.edu;Bitnet arkics@ritvax. 


Introduction to Parallel Algorithms and Architectures: 

Arrays, Trees, Hypercubes 

F. Thomson Leighton (Morgan Kaufmann Publishers, San Mateo, Calif., 1992, ISBN 1-55860-117-1, 831 pp., $54.95) 


The field of computing is much larg¬ 
er and noisier in 1992 than it was in 
1968, the year that volume one of 
Knuth’s The Art of Computer Pro¬ 
gramming first appeared. Certainly, it 
is more difficult now for one author to 
have an impact equal to Knuth’s in 
the late sixties and early seventies. 
However, after finishing Leighton’s 
book, it is hard to escape the feeling 
that one has just read an important 
document in the field — particularly if 
one believes that by the end of this 
decade, “computing” will be synony¬ 
mous with “parallel computing.” 

More than 800 pages long, this text 
clearly aspires to encyclopedic cover¬ 
age. This impression is reinforced by 
the fact that a second volume is in 
preparation. However, Leighton’s 
style is far from the dense, dry discus¬ 
sions that characterize so many ency¬ 
clopedias. Instead, it is lively and 
readable, continually connecting top¬ 
ics and themes in a way that enriches 
the text. Equally important, particu¬ 
larly in a book this size, Leighton pre¬ 
sents complex material in a way that 
lets readers understand the gist of his 
arguments even when they do not 
have the time or inclination to work 
through the details. 

Do not misunderstand the last com¬ 
ment. This is not a book for the faint¬ 
hearted or for one who just wants an 
overview of the state of parallel com¬ 
puting today. Rather, it is a deep and 
mathematically sophisticated analysis 


of parallel algorithms; paper and pen¬ 
cil are necessary adjuncts to reading 
the text. 

It is divided into three long chap¬ 
ters: Arrays and Trees, Meshes of 
Trees, and Hypercubes and Related 
Networks. The focus is on parallel dis¬ 
tributed-memory systems, with con¬ 
nection topologies as specified in 
these chapter titles. The architectural 
discussions are formal, focusing on 
logical rather than physical architec¬ 
ture. In this sense, the presentation is 
geared more to programmers than en¬ 
gineers. Leighton provides enough de¬ 
tail to support careful analysis of such 
issues as speedup and efficiency. 

He includes algorithms for many 
standard computing problems — sort¬ 
ing, counting, integer arithmetic, ma¬ 
trix algebra, etc. — but avoids the im¬ 
pression of a catalog by introducing 
them in an evolutionary style, using 
the analysis of one approach to sug¬ 
gest another or showing how an archi¬ 
tectural modification can improve an 
algorithm’s efficiency. He also in¬ 
cludes interesting discussions of issues 
such as packet routing. 

His presentation is visually orient¬ 
ed. He creates strong images of ser¬ 
ried ranks of data marching through 
networks of processors and combining 
in intricate patterns. The beauty and 
geometric symmetry of many parallel 
algorithms is revealed. Nowhere is an 
algorithm presented in pseudocoded 
form. The author relies on diagrams 


and clear expository style, and in most 
cases this is more than adequate. One 
could argue that this makes the book 
less useful to the programmer looking 
for an algorithm to solve a given prob¬ 
lem. However, given the myriad paral¬ 
lel architectures currently available 
and the lack of agreement on a para¬ 
digm for describing parallel algo¬ 
rithms, Leighton’s approach is proba¬ 
bly the only one that guarantees some 
universality. 

The book includes over 750 exercis¬ 
es, classified in various ways. To fur¬ 
ther ease the task of selecting repre¬ 
sentative problems from such a large 
pool, the author marks the most 
worthwhile problems (about one third 
of them or, as he states, one problem 
for every three pages of reading). This 
structure is so reminiscent of Knuth 
that it makes one feel as if Knuth’s 
work is the scale against which Leight¬ 
on wants his work to be measured. 

This is an important book: compre¬ 
hensive, thought-provoking, and — as 
far as a book on this topic can be — 
enjoyable to read. It was designed as a 
text for an introductory course on par¬ 
allel algorithms. It is clearly more than 
that, and its size might even discourage 
choosing it as a text. However, this is 
the kind of book a student will keep as 
a source of ideas and techniques long 
after the course is over. 

Andrew Kitchen 

Rochester Institute of Technology 


A Theory of Computer Semiotics 

P.B. Anderson (Cambridge University Press, Cambridge, UK, 1990, ISBN 0-521-39336-1, 416 pp., $59.50) 


“In my professional work as a lin¬ 
guist, I have been obsessed with two 
topics: computers and the way people 
talk when they work.” Thus P.B. 
Anderson sets the scene for a book 
that lays some new foundations for 
computer science research by showing 
how and why to use semiotic theory in 
the study of human-computer interac¬ 
tion (HCI). This book can and should 


be used by researchers and developers 
of computer software everywhere as 
an antidote to their education and a 
bracing methodological tonic. 

Semiotics is the study of signs and 
symbols. Growing out of linguistics, it 
was an academic fad of the late 1970s 
and early 1980s, especially in Europe 
and particularly in epistemology, liter¬ 
ary criticism, and human psychology. 


Semiotics leads to a notion of lan¬ 
guage as the creation of meaning in a 
social process that achieves a consen¬ 
sus. It is an alternative to the tradi¬ 
tional mathematical frameworks used 
in computer science. I have felt a need 
for such an alternate theoretical foun¬ 
dation in computer science ever since 
asking a number-theorist colleague 
for his concept of semantics. “It’s sim- 
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pie,” he replied. "There are two val¬ 
ues: true and false.” As a scientist 
with a background in humanities and 
clinical psychology, I wasn’t con¬ 
vinced. 

Anderson’s tack is humanistic and 
hermeneutic. He reconsiders the “ma- 
chine-as-human” metaphor that has 
underpinned most AI research and re¬ 
places it with a “machine-as-medium” 
metaphor — more appropriate to a 
field fundamentally concerned with 
communication. He also rejects the 
practice of borrowing calculi from 
other sciences such as transformation¬ 
al grammar and predicate calculus. In¬ 
stead he extends and adapts structur¬ 
alist linguistic methods, which 
construct algebras from the analysis of 
existing texts. 

Anderson points out the difference 
between the study of language usage 
or particular language users and the 
study of language itself, which preex¬ 
ists and survives individual users. He 
sees an analogy between language us¬ 
age and our use of computers and 
considers linguistic methods particu¬ 
larly appropriate in HCI research: 
“What immanent semiotics can offer 
HCI is precisely a discourse for under¬ 
standing the nonindividual, contractu¬ 
al aspects of an interface and for de¬ 
scribing the interface (and other parts 


When I fell into the real-time sys¬ 
tems business some 20 years ago, I 
dreamed of solving any kind of prob¬ 
lem simply by getting another PDP- 
11, programming it to catch another 
interrupt — possibly in Fortran, and 
delivering it to the customer in the 
glory of what computers can do. 

Nowadays, we need better theoreti¬ 
cal support for making optimal design 
and implementation decisions. This 
book addresses this issue. Its major 
paradigm for real-time system design 
is control theory applied to discrete 
event processes. Starting from the de¬ 
scription of an observable plant be¬ 
havior (“plant” means an object to be 
controlled), a design problem is de¬ 
fined as finding a computer program 
(or a controller, in the language of 
control theory) to achieve the desired 
behavior. Verification is a related 
problem and is formulated as follows: 
Given the plant and controller behav¬ 
ior, demonstrate that the required 
plant behavior is actually achieved. 

To solve both problems, the author 
uses a theoretical framework of ex¬ 
tended state machines (ESMs) to de¬ 


of the system) in a direct, nonmeta- 
phorical way.” 

By getting away from the notion of 
the HCI as a model for something 
else, like a desktop or a window, we 
can reveal many relations and opera¬ 
tions that are currently concealed by 
metaphor. Anderson shows in detail 
how metaphor and analogy are used 
and thus breaks down any illusions 
about their power. This is particularly 
useful to the interface designer, who is 
engaged in a constant effort to bal¬ 
ance truth and illusion, mediating be¬ 
tween what the computer program 
can do and what the computer user 
wants to achieve. 

The book questions our assump¬ 
tions about computer programs right 
down to the level of epistemology: 
Anderson takes on Winograd and 
Flores’ assumption that “cognition 
rests on the manipulation of symbolic 
representations that can be under¬ 
stood as referring to objects and prop¬ 
erties in the world.” Anderson be¬ 
lieves that “the description of the 
program as representing something is 
a description by some observer after 
the fact, rather than by its designer.” 
This view resonates with my experi¬ 
ence as a practical engineer, where I 
have seen programs put to many ef¬ 
fective uses never intended by their 


scribe the plant and controller behav¬ 
ior. Within the single formalism, 
known as the representation language, 
that describes this behavior it is diffi¬ 
cult to prove the correctness of ESM 
descriptions. The author therefore 
uses another language — a real-time 
temporal logic (RTTL) — to reason 
about the correctness of the required 
plant behavior. This theory extends 
the Manna and Pnueli temporal logic 
for dealing with systems whose cor¬ 
rectness depends on both the logical 
result of the system behavior and the 
time at which the results are pro¬ 
duced. 

Both problems can now be formulat¬ 
ed in the new theoretical framework: 

(1) Design problem — Given an 
RTTL specification S for the ESM plant, 
is there an automatic way to produce an 
ESM controller so that the closed-loop 
ESM plant and controller satisfy 5? 

(2) Verification problem—Given the 
ESM plant and controller, do the plant 
and controller (the closed-loop compo¬ 
sition of plant and controller) satisfy 5? 


designers. Anderson sees a program 
as a medium for the evolution of a 
function rather than as a fully speci¬ 
fied and static mechanism. 

The description of his methodology 
is lucid. He sees the HCI as a medium 
constructed from “computer-based 
signs” with a set of features. Handling 
features are produced by the user; 
they articulate user actions (key¬ 
strokes, mousing). Permanent and 
transient features are characteristics 
of the screen; they articulate the parts 
of a system and its transformations. 
Anderson uses these basic units of 
analysis to construct “system texts” 
consisting of computer-based signs. 
Such analysis predicts dependencies 
and other relationships between com¬ 
puter-based signs and the meaning 
they generate, which can be used as a 
basis for synthesis of new action se¬ 
quences and new HCI designs. 

The one thing to criticize about this 
book is the opaque terminology inher¬ 
ited with linguistic theory. If you can 
hack the code, it will introduce you to 
a new way of thinking about programs 
and how people use them. 


Vivienne S. Begg 
Xerox Advanced Information 
Technology 


224 pp., $74.95) 

The author presents the proof sys¬ 
tem for reasoning about both prob¬ 
lems and a procedure for designing 
and verifying a controller. Then he 
gives two interesting and rather well 
known examples, one related to con¬ 
trol of the train gate system at a rail 
way crossing and the other related to 
the track that two trains share. Since 
there are no guarantees that the speci¬ 
fication can be proved, the author 
presents additional rules, called deci¬ 
sion procedures, which form an im¬ 
portant part of the book. 

The book is addressed to software 
and control engineers, as well as seri¬ 
ous students of computer science and 
control theory, but it requires a good 
theoretical background in both areas. 
Although temporal logic for real-time 
systems is still commercially imma¬ 
ture, this book is a promising step in 
the direction of investigating and cre¬ 
ating a formal ground for real-time 
systems applications. 


Janusz Zalewski 

Southwest Texas State University 


Temporal Logic for Real-Time Systems 

Jonathan Ostroff (John Wiley and Sons, New York, 1990, ISBN 0-471-92402-4, 
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“Any clod 

CHANNEL 


have the facts, but having opinions is an art." 

Charles McCabe, San Francisco Chronicle 


Editor: Will Tracz, IBM, Federal Sector Division, Mail Drop 0210, Owego, NY 13827 
e-mail tracz@owgvmO.vnet.ibm.com 


Looking for the new challenge 


Software developers are used to de¬ 
livering or buying software that, by 
default, is not guaranteed to work. 
(Read any software license. Would you 
buy a car with that type of guarantee?) 

Current approaches to computer sci¬ 
ence and programming resemble Mid¬ 
dle Ages alchemy. They are faddish, 
changing with any new gurus, trends, 
or fashions that come along. They op¬ 
erate on concepts and definitions with¬ 
out unified classification. The search 
for something that will turn base met¬ 
als into gold goes on, although the peri¬ 
odic table has not yet been considered. 
The newest fad is OOPS — Object- 
Oriented Pieces of Something. 

Modern computer-science age an¬ 
nouncements and “discoveries” are 
similar to the patterns of Middle Age 
research and developments. All that 
has changed is the wording: Instead of 
saying “Take a spoon of spider eggs 
and you will be healed,” we now say, 
“The newest OOPS-WOOPS-BOOPS 
will improve your software and make 
your software backlog vanish.” 

We work hard to get applications 
running, but what we deliver is soft¬ 
ware that, immediately after installa¬ 
tion, requires enhancements, mainte¬ 
nance, etc. The word maintenance 
doesn’t mean the removal of software 
defects (bugs); it means modification 
of code, because requirements have 
changed. The only explanation is: It 
must be this way, because no one can 
foresee the future needs of users. 

The reality of complex software sys¬ 
tems is sad: Programmers and applica¬ 
tion designers spend more time main¬ 
taining and enhancing programs than 
creating solutions — a waste of hu¬ 
man potential. 

Assorted observations. Consider 
the following observations: 

• Software systems are becoming 
larger and more complex. They still 
require time-consuming and costly 
changes and adaptations. Users have 
had to accept these changes as a ne¬ 
cessity of software maintenance. 

• New hardware products are faster, 
more powerful, and more efficient. 
They urgently need new software ap¬ 


plications that differ essentially from 
current developments. New solutions 
must be flexible. They must not burden 
their creators with permanent mainte¬ 
nance of running programs. Software 
system users must be able to change 
the behavior of their programs imme¬ 
diately, without programmers or pro¬ 
gramming. Modern software must be 
able to adapt dynamically to new envi¬ 
ronments during operation. Traditional 
adjustments can be too slow if program 
adaptations require editing the source 
code, accessing data dictionaries and 
repositories, and using class browsers, 
compilers, and linkers. New program 
errors can slow down user operations. 

• Modern languages and object-ori¬ 
ented methods do not provide solu¬ 
tions to this class of problems. They 
merely provide a possibility for faster 
implementations of changes or faster 
program writing — efficiently creat¬ 
ing spectacular “quick fixes.” Pro¬ 
gramming specialists are needed to 
make changes in large applications as 
soon as users say, “But I meant some¬ 
thing else.” 

• The newest programming tech¬ 
niques do not approach this class of 
problems. We still talk about the lan¬ 
guages and methodologies that pro¬ 
vide faster and more comfortable pro¬ 
gram writing. 

• The topic of software systems that 
could run without software mainte¬ 
nance seems nonexistent. 

• The future of software develop¬ 
ment and programming lies in deliver¬ 
ing software that can run for years 
without needing code changes, even if 
new requirements arise. Managers 
must be able to introduce new busi¬ 
ness practices without having them 
specially programmed. Auditors 
should be able to adjust ledger sys¬ 
tems to new laws without having to 
ask IS to do it for them. The prepara¬ 
tion of specifications, and system and 
program design of such applications 
must be essentially different. 

Many of you might say that if these 
proposals are enacted, programmers 
will be put out of work. I disagree. I 
do not think programmers like to 
maintain programs written and rewrit¬ 


ten by many others. I cannot believe 
that anybody likes to modify an oper¬ 
ating, three-year-old program because 
laws have changed or because the new 
boss wants a new layout of the stan¬ 
dard invoice forms. 

The current backlog cannot be pared 
with existing methods and principles. 
User sophistication grows so fast that 
even if we program faster, the backlog 
may increase. Simultaneously, we can¬ 
not fully use our creativity when such a 
backlog exists. The result is a typical 
catch-22 situation. 

New challenge. The next software 
challenge involves the development of 
applications that do not need code 
maintenance. 

Try this definition: A software sys¬ 
tem or an application is supported by 
a zero-maintenance system when its 
users can change and enhance the be¬ 
havior of their programs without hav¬ 
ing to modify the source code (that is, 
without programming). In other 
words, users can tailor or customize 
the behavior of their software without 
help from computer specialists and 
without programming. 

This may sound like a radical de¬ 
parture from conventional software, 
but ironically there are already a few 
running application systems that use 
the ideas I’ve mentioned. The impact 
of zero maintenance has been revolu¬ 
tionary for the involved users, soft¬ 
ware designers, programmers, and 
those financing the projects. Unfortu¬ 
nately, academia isn’t interested. Pro¬ 
fessors and scientists typically ask, 
“Who ever heard of zero-maintenance 
software?” This concept flies in the 
face of conventional wisdom. 

Can you imagine users evolving their 
own business concepts and rules “on 
the fly,” confident that their computer 
systems will support the new ideas? 

Can you imagine the competitive 
edge for an insurance company or 
bank that could introduce new prod¬ 
ucts ad hoc, without adding to or 
modifying the program code? 

Can you imagine the frontiers that 
could open for science? 

Peter Ligezinski, Vienna, Austria 
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You want the power of a 
standard development OS. But 
you also want the performance of 
a realtime OS. Which way do you 
turn? 

Presenting QNX 4.0. The 
operating system that’s responsive 
enough for realtime apps, small 
enough for PC platforms, flexible 
enough for transparent 
networking, and modular enough 
for the most demanding 
configurations. 

POSIX Means Portable 

Operating systems come in 
more flavors than ice cream. 

Which is why IEEE’s POSIX 
standard is now such an important 
safeguard of portability. 

QNX is a high-performance 
realtime OS with its own unique 
microkernel architecture.* But its 
API is based on the latest POSIX 
standards, so you get both 
outstanding performance and 
portability for all your apps. 


Performance At Run Time, 

Design Time, All the Time 

Only QNX combines the 
performance of a dedicated 
realtime executive with the time¬ 
saving benefits of a rich 
development environment— 
including a host of utilities, an 
award-winning C compiler, and an 
optional OPEN LOOK® GUI 
package. 

QNX Is Distributed 

The QNX operating system 
lets you extend the limits of any 
one microprocessor. Whether 
you’re running a network of four or 
400 machines, QNX makes it all 
feel like a single computer. 



Interprocess communication 
is network-wide, so every process 
can transparently access every 
resource—programs, files, 
devices, even CPUs—anywhere on 
the network. And you can set up 
your network using any mix of 
Intel-based PCs. 

Responsive Tech Support 

QNX’s support hotline can put 
you in direct contact with the 
Technical Development team itself. 
And you’ll have access to our 
24-hour online conferencing and 
update system, where the 
response time to your questions is 
almost like real time. 

“If Only...” 

If you think you have to 
choose between a standard 
development OS and a high- 
performance realtime OS, think 
again. 

Now you can have it both ways. 


Quantum Software Systems Ltd. 


For more information, phone 
1 - 800 - 363 - 9001 . 

■ 175 Terrence Matthews Crescent ■ Kanata, Ontario, Canada ■ K2M 1W8 ■ 1-613-591-0931 

P P -QNX is not based on UNIX source code. 

emark of Quantum Software Systems Ltd. UNIX and OPEN LOOK are registered trademarks of UNIX System Laboratories, Inc. 
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